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By 
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Chairman:  MeJay  Ghosh 
Major  Department:  Statistics 

Hierarchical  Bayes  (HB)  estimators  of  the  variance  and  the  variance  ratio  in  nor- 

msd  models  are  developed  with  the  objective  of  dominating  the  corresponding  best 

multiples  of  the  sample  variance  or  the  ratio  of  the  two  sample  variances.  We  also 

develop  highest  posterior  density  (HPD)  credible  intervals  for  the  variances  in  some 

cues. 

First,  point  and  interval  estimation  of  the  error  variance,  say  a^,  is  considered 
in  the  fixed-effects  balanced  one-way  normal  ANOVA  model.  Two  general  classes  of 
HB  estimators  of  a^"  are  proposed  which  dominate  the  best  multiple  estimator  5** 
(5  being  the  error  sum  of  squares)  under  the  entropy  loss.  The  dominance  of  the 
proposed  estimators  over  the  best  multiple  estimator  is  proved.  We  then  find  two 
classes  of  HPD  credible  intervals  based  on  the  two  classes  of  priors  for  using  all 
the  cell  means  and  the  error  sum  of  squares.  Such  intervals  have  smaller  length  and 
greater  frequentist  coverage  probability  than  the  usual  confidence  intervals  based  on 
5  only.  Also,  other  classes  of  HPD  credible  intervals  are  provided  which  have  the 
same  length  but  higher  coverage  probability  than  the  usual  intervals. 

In  Chapter  3,  HB  estimators  of  the  disturbance  variance  are  developed  in  two 
nested  regression  models.  Risk  dominance  of  the  HB  estimators  over  the  best  multiple 

vi 


estimators  is  proved  and  expressions  for  the  risk  dominance  are  given  for  both  models. 
Also,  HPD  credible  intervals  are  provided  for  both  models. 

In  the  next  chapter,  a  class  of  HB  estimators  is  developed  for  the  ratio  of  variances 
from  independent  random  samples  from  two  normal  distributions.  Using  an  extension 
of  the  class  of  hierarchical  priors  for  the  one-sample  case,  two  classes  of  HB  estimators 
are  developed.  We  prove  the  risk  dominance  of  these  estimators  and  provide  sufficient 
conditions  under  which  a  subclass  of  these  estimators  dominates  the  best  multiple 
estimator.  For  one  of  these  subclasses,  the  expression  for  the  risk  improvement  is 
also  provided. 

Numerical  calculations  of  the  percentage  risk  improvement  of  the  proposed  HB 
estimators  over  the  best  multiple  estimators  in  all  these  situations  indicate  that  the 
risk  improvement  can  often  be  quite  substantial. 
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CHAPTER  1 
INTRODUCTION 

1.1    Literature  Review 

Decision  theoretic  estimation  in  normal  linear  models  has  been  an  active  area 
of  statistical  research  for  a  long  time.  Ever  since  the  appearance  of  Stein's  (1956) 
seminal  paper  on  the  inadmissibility  of  the  normal  mean  in  three  or  higher  dimensions, 
considerable  amount  of  research  effort  has  been  spent  on  simultaneous  estimation  of 
the  means  or  the  regression  coefficients  in  general  linear  models.  Useful  reviews  of 
the  topic  have  appeared  in  Brown  (1986)  and  Ghosh  (1992a). 

In  contrast  to  the  meaji  estimation  problem,  literature  on  decision  theoretic  es- 
timation of  the  variance  or  variance  components  in  normal  linear  models  is  not  so 
rich.  It  was  known  for  a  long  time  that  neither  the  MLE  nor  the  UMVUE  was  the 
best  multiple  of  the  error  sum  of  squares  when  estimating  the  normal  variance,  say 
(7^.  The  best  multiple  of  the  error  sum  of  squares  was  also  the  best  equivariant  es- 
timator under  a  group  of  location- scale  transformations.  However,  in  yet  another 
fundamental  work  of  Stein  (1964),  it  was  shown  that  this  best  equivariant  estimator 
was  inadmissible  under  any  quadratic  loss, 

LQ{a,a^)  =  Q{a'){a-a')\  (1.1.1) 

where  Q{cr^)  >  0.  The  estimators  that  were  developed  by  Stein  (1964),  which  domi- 
nated the  usual  best  multiple  estimator,  were  based  upon  the  acceptance  or  rejection 
of  the  null  hypothesis  that  the  population  mean  was  equal  to  a  specified  value.  Al- 
though these  estimators,  often  labeled  as  preliminary  testimators,  dominated  the 
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usual  estimator  by  virtue  of  having  lower  risk  under  any  quadratic  loss,  they  were 
non-smooth  and,  therefore,  were  not  admissible  under  any  arbitrary  quadratic  loss 
(see  e.g.  Sacks,  1963).  Stein  (1964)  conjectured  that,  in  the  one-way  analysis  of 
variance  (ANOVA)  situation,  within  a  class  of  estimators  equivariant  only  under  a 
transformation  of  scale,  a  substantial  improvement  may  be  obtained  by  a  Bayesian 
solution  when  the  ratio  of  the  number  of  unknown  means  to  the  number  of  obser- 
vations is  sufficiently  large.  This  conjecture  has  been  supported  in  the  numerical 
findings  of  Ghosh  (1992b). 

Brown  ( 1968)  showed  that  the  usual  best  multiple  estimator  of  the  normal  variance 
was  inadmissible  under  a  wider  class  of  loss  functions.  This  paper  offered  valuable 
insight  into  the  statistical  problems  involving  unknown  location  and  scale  pajameters, 
while  not  attempting  to  propose  any  actu2Ll  estimators  of  the  variance  which  improved 
substantially  on  the  usual  estimator.  Brown  (1968)  also  argued  against  the  use  of 
quadratic  loss  for  estimating  any  power  of  the  scale  parameter,  in  particular  the 
variance,  one  of  his  main  reasons  being  that  the  best  equivariant  estimator  of  the 
variance  is  biased.  Properties  of  some  alternative  loss  functions  and  the  corresponding 
best  equivariant  estimators  were  also  provided  in  this  paper.  The  main  result  of 
Brown's  paper  showed  that  there  exists  a  unique  loss  function  for  the  estimation  of 
cr^**,  (a  >  0),  namely  the  entropy  loss, 


for  which  the  best  equivariant  estimator  under  a  group  of  location-scale  transforma- 
tions is  always  unbiased.  The  entropy  loss  can  be  interpreted  as  the  Kullback-Leibler 
divergence  measure  between  two  normal  distributions,  one  indexed  by  the  parameter 
a,  and  the  other  by  as  the  population  variance.  But,  the  unbiased  best  equivariant 
estimator  of  a^"  is  also  inadmissible. 


(1.1.2) 


Brewster  and  Zidek  (1974)  obtained  a  hierarchical  Bayes  (HB)  estimator  of  a^" 
by  extending  the  results  of  Stein  (1964).  The  resulting  HB  estimator  was  minimcix 
and  admissible  within  the  class  of  scale-equivariant  estimators  for  the  univariate  case. 
Proskin  (1985)  proved  the  admissibility  of  Brewster-Zidek  estimators  within  the  cleiss 
of  all  estimators.  Strawderman  (1974),  using  a  similar  technique,  developed  a  class 
of  minimeix  estimators  and  made  an  attempt  to  prove  the  risk  dominance  of  an  HB 
estimator  over  the  best  multiple  estimator  of  the  variance  in  the  one-sample  case. 
However,  as  noted  in  Ghosh  (1992b),  Strawderman's  result  was  incorrect. 

It  is  well-known  that  in  the  one-way  analysis  of  variance  situation,  under  the 
group  of  location-scale  transformations,  the  best  equivariant  estimator  of  under 
the  loss 

L{a,c^'')  =  u-^^{a-a^''f  (1.1.3) 

is  given  by  c„5°  where  c«  =  2-°V{\{{n-l)k^2a))/V{\{{n-\)k^-Aa)),  where  S  is 
the  error  sum  of  squeires.  Also,  this  estimator  is  the  constant  risk  minimax  estimator 
of  (7^°  under  the  same  loss.  In  addition,  is  the  best  multiple  of  8°  under  any 
arbitrary  quadratic  loss 

L<^(a,a2'')  =  g(a2«)(a-(72°)^  (1.1.4) 

where  C?(cr2<»)  >  0. 

However,  within  the  bigger  class  of  estimators  equivariant  only  under  scale  trans- 
formations, CaS"  is  an  inadmissible  estimator  of  (t^"  under  every  single  Lq  loss  given 
in  (1.1.4).  Indeed,  Stein-type  estimators  dominating  Ca5°  for  a  general  a  (>  0)  were 
produced  by  Gelfand  and  Dey  (1988a),  when  the  number  of  cells  equals  one. 

Ghosh  (1992b)  derived  a  class  of  hierarchical  Bayes  (HB)  estimators  of  cr^'*  (a  >  0) 
under  the  loss  (1.1.3).  Such  estimators,  unlike  the  Stein-type  estimators,  are  smooth. 


r- 
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A  subclass  of  these  HB  estimators  dominate  CaS°  under  any  arbitrary  quadratic  loss 
given  in  (1.1.4)  in  the  sense  of  having  smaller  frequentist  risk.  The  estimator  proposed 
earlier  by  Brewster  and  Zidek  (1974)  when  a  =  1  is  a  member  of  this  subclass  of 
these  estimators.  The  numerical  findings  of  Ghosh  (when  a  =  1 )  indicated  that  often 
these  risk  improvements  could  be  quite  substantial.  Ghosh  (1992b)  used  a  direct 
argument  to  prove  the  risk  dominance  of  the  HB  estimators  over  the  best  multiple 
estimator,  whereas  Kubokawa  (1991)  used  a  definite  integral  technique  to  develop 
a  class  of  scale-equivariant  estimators  (not  necessarily  Bayes)  which  dominated  the 
best  multiple  of  the  error  sum  of  squares. 

All  of  the  articles  mentioned  above  deal  with  the  point  estimation  of  the  variance 
or  the  variance  component.  Tate  and  Klett  (1959)  provided  the  expression  for  the 
shortest  confidence  interval  of  that  depended  only  on  the  sample  variance,  S. 
Cohen  (1972)  allowed  the  presence  of  unknown  means  and  used  the  estimators  similar 
to  those  given  by  Brown  (1968)  to  construct  confidence  intervals  with  the  same  length 
as  those  given  by  Tate  and  Klett  (1959)  and  higher  coverage  probability.  Using 
the  estimators  developed  by  Brewster  and  Zidek  (1974),  Shorrock  (1990)  developed 
a  confidence  interval  that  improved  upon  the  interval  obtained  by  Cohen  (1972). 
Maatta  and  Casella  (1987)  discussed  the  conditional  properties  of  the  confidence 
intervals  developed  till  then.  Nagata  (1989)  developed  confidence  intervals  using 
Stein-type  testimators  and  provided  numerical  studies  of  the  improvements  in  both 
length  and  coverage  probability.  Maatta  and  Casella  (1990)  provided  a  very  good 
review  of  the  confidence  procedures  for  the  estimation  of  the  variance  from  a  normal 
population. 


1.2    Overview  of  this  Manuscript 

In  this  dissertation,  we  develop  hierarchical  Bayes  estimators  for  the  variance  in 
a  variety  of  different  situations.  We  also  develop  highest  posterior  density  (HPD) 
credible  intervals  for  the  variances  in  some  cases.  Berger  (1985)  contains  a  discussion 
on  HPD  credible  intervals.  The  hierarchical  Bayesian  models  that  are  considered  here 
can  be  regarded  as  extensions  of  the  hierarchical  Bayes  ideas  of  Lindley  and  Smith 
(1972),  and  are  available  for  example  in  Ghosh  (1992a)  and  Strawderman  (1974). 

In  Chapter  2,  we  consider  the  estimation  of  a^",  where  a'  is  the  error  variance 
in  a  fixed-effects  balanced  one-way  ANOVA.  Ghosh  (1992b)  developed  a  class  of  HB 
estimators  which  dominate  the  best  multiple  estimator  of  the  sample  variance  under 
every  single  loss  of  the  form  given  in  (1.1.4).  We  consider  here  estimation  of  cr^"  under 
the  entropy  loss  given  in  (1.1.2).  Under  this  loss,  the  best  equivariant  estimator  of 
a^''  is  given  by  its  UMVUE,  d„S°  where 

r(l((„-l)»:  +  2a))- 

Two  classes  of  hierarchical  Bayes  priors  are  considered  in  Section  2.2.  Similar  priors 
were  used  by  Ghosh  (1992a)  and  Morris  (1983)  for  the  simultaneous  estimation  of 
means.  Conditions  are  also  provided  under  which  a  subclass  of  these  estimators  has 
smaller  risk  thein  daS".  We  use  a  modified  version  of  a  theorem  by  Kubokawa  (1991 ) 
to  prove  the  risk  dominance  of  the  HB  estimators  over  the  best  multiple  estimator. 
An  expression  for  the  risk  improvement  is  obtained  for  each  of  these  two  classes  of  HB 
estimators  when  a  =  1.  We  provide  some  numerical  calculations  of  the  percent  risk 
improvement  and  show  that  often  the  risk  improvement  can  be  quite  large.  The  last 
section  of  the  chapter  deals  with  the  derivation  of  HPD  credible  intervals.  We  obtain 
expressions  for  credible  intervals  which  are  shorter  and  have  higher  probability  of 
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coverage  than  the  minimum  length  and  shortest  unbiased  confidence  intervals  based 
on  the  sample  variance.  5,  only  developed  by  Tate  and  Klett  (1959)  and  discussed  in 
Maatta  and  Casella  (1990).  Also,  we  develop  credible  intervals  using  the  formulation 
of  Cohen(1972)  and  Shorrock  (1982,  1990).  Although  this  latter  class  of  HPD  credible 
intervals  have  the  same  length  as  the  ones  based  on  S  only,  they  have  higher  coverage 
probability. 

In  Chapter  3,  we  develop  HB  estimators  of  the  disturbajice  variance  in  two  nested 
regression  models.  Gelfand  and  Dey  (1988a)  developed  Stein- type  testimators  for 
the  disturbance  variance,  but  their  estimators  were  not  smooth.  Risk  dominance  of 
the  HB  estimators  over  the  best  multiple  estimator  is  proved  for  both  the  full  and 
reduced  regression  models.  For  the  reduced  model,  we  use  a  class  of  hierarchical 
priors  similar  to  the  one  used  by  Ghosh,  Lee  and  Littell  (1990)  for  the  estimation  of 
the  response  function.  Also,  HPD  credible  procedures  are  provided  for  both  the  full 
and  the  reduced  models. 

In  Chapter  4,  our  objective  is  to  develop  a  class  of  HB  estimators  for  the  ratio  of 
variances  from  two  independent  random  samples.  Loh  (1986)  provided  some  adap- 
tive versions  of  the  maximum-likelihood  (ML)  and  restricted  maximum-likelihood 
(REML)  estimators  of  the  variance  ratio.  Gelfand  and  Dey  (1988b)  obtained  a  pair 
of  non-smooth  estimators  for  the  variance  ratio  which  dominated  the  best  multiple 
estimator  under  any  arbitrary  quadratic  loss.  Using  an  extension  of  the  class  of  hi- 
erarchical priors  for  the  one-sample  case,  we  develop  two  classes  of  HB  estimators 
for  the  variance  ratio.  We  prove  the  risk  dominance  of  these  estimators  by  usmg  a 
two-sample  version  of  the  direct  integral  technique  of  Kubokawa  (1991).  We  provide 
sufficient  conditions  under  which  these  classes  of  estimators  dominate  the  best  mul- 
tiple estimator.  For  one  of  these  subclasses,  the  expression  for  the  risk  improvement 
is  also  provided. 


Before  concluding  this  chapter,  we  provide  in  the  next  section  certain  properties  of 
chi-squared  distributions  which  are  used  repeatedly  in  finding  optimal  Bayes  credible 
intervals  and  in  proving  the  frequentist  properties. 


1.3    Properties  of  Chi-squared  distribution 

In  this  section,  we  provide  a  series  of  lemmas,  which  exhibit  some  properties  of 
the  chi-square  random  variable. 

Lemma  1.3.1  Let  x  <  y  and  m  <  n.  Then, 

P{xi<x)  P(xl<y)-P{xl<y)  P(xl<x)>o 

Proof. 


Jo  Jo  r(f)r(f)  '  ' 

—  —  1  -—1 

^      *     •»  * 


rx/2    ry/2  ~  3       yl  '  ' 

Jo   Jo  r(f)r(=)  '  ' 
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Jx/2  Jo 


y/2    fx/2  g-(*l+*s) 


x/2  Jo  T{r^)m) 


n — m  n— m 


dz\dz2 


>  0 


(1.3.1) 


Lemma  1.3.2  Let  /i,(x)  denote  the  pdf  of  f  >  2  j  and  let  F^{x)  =  P(x^  <  x] 
Then  fv{x)^ Fu{x)  is  [  in  x. 

Proof.       Integrating  by  parts, 


F^{x)  =  F,_2(x)  -  2^(x) 


(1.3.2) 


Hence, 


Ux)/F.{x)  =  \  [F._2(x)/F.(x)  -  1] 


(1.3.3) 


Then,  using  Lemma  1.3.1,  and  the  expression  given  in  (1.3.3),  it  follows  that  the  ratio 
fi,{x)/F„{x)  is  I  in  x. 

Lemma  1.3.3  Let  /^(x)  denote  the  pdf  of  a  xl  random  variaWe  and  F^(x)  =  P{xl  < 
x).  Then  xf^{x)^F^{x)  is  i  in  x. 

Proof.      Let  h{x)  =  x/,(x)/F,(x).  Then, 


h{x)  = 


3.g-x/2ji-/2-l 


F,(x)  [  2''/2r(i//2) 


^fi'+2ix) 

F.{x) 


V 

2 


1  - 


F.^2{x) 
F^x) 


using  (1.3.2). 


(1.3.4) 


Since,  F^+2ix)/ F^{x)  is  t  in  x  from  Lemma  1.3.1,  the  result  follows. 


Lemma  1.3.4  Let  F^x)  =  P{xl  <  x).  Then,  for  0  <  toi  <  wj,  F^{w2z)/ F^w^z)  is 
i  in  z. 


Proof.  Let  g{z)  =  F^{w2z)^ Fu{wiz).  Then,  differentiating  with  respect  to  2,  we 
have 


W^Ujw^z)  _  Wif^{wi,z)Ft,{w2z) 

F^{w^z)  {F,{wrz})^ 


FJW2Z) 

\:^iz) 


{w2Z)fy{W2Z)  _  {wiZ)fUwiz) 

FUW2Z)  F^{wiz} 


<  0, 


using  Lemma  1.3.3. 


(1.3.5) 


Lemma  1.3.5  Let  F,(i;  A)  =  P  (xl,x  <  x)  and  F^{x)  =  P  (xl  <  x).  Then,  for  X  >  0, 

F.(x;A) 


Proof.       For  A  >  0,  one  has 


F.{x) 


T     in  X. 


t     in  X. 


Therefore,  for  xi  <  X2, 


P  (xl,,  <  X2)  P  {xl  <x,)-P  (xl,  <  X,)  P  {xl  <  X2) 


Jo  Jo  ^xi^y^fxiS'^^yd^ 
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0 


(1.3.6) 


CHAPTER  2 
ESTIMATION  OF  THE  NORMAL  VARIANCE 


2.1  Introduction 


Consider  a  fixed-effects  balanced  one-way  analysis  of  variance  (ANOVA)  model 
with  homoscedastic  errors.  As  mentioned  in  Section  1.2,  a  decision-theoretic  approach 
is  taken  in  this  chapter  towards  point  and  interval  estimation  of  cr^'*  (a  >  0,  known), 
where  denotes  the  error  variance.  We  denote  by  S,  k  and  n  the  error  sum  of 
squares,  the  number  of  cells  and  the  number  of  observations  per  cell  respectively. 
We  consider  point  estimation  under  the  entropy  loss  given  in  (1.1.2).  It  is  well- 
known  that  under  the  group  of  location-scale  transformations,  the  best  equivariant 
estimator  under  the  loss  Lia,a^'')  =  c7"''°(a  -  a^'^f  is  given  by  CaS"  where  = 
2-"r(i((n-l)fc  +  2a))/r(|((n-l)fc  +  4a)).  Also,  this  estimator  is  the  constant  risk 
minimax  estimator  of  a^"  under  the  same  loss.  In  addition,  c  is  the  best  multiple  of 
S"  under  any  arbitrary  quadratic  loss  of  the  form  given  in  (1.1.4). 

However,  within  the  bigger  class  of  estimators  equivariant  only  under  scale  trans- 
formations, CaS°'  is  an  inadmissible  estimator  of  (7^°'  under  every  single  Lq  loss  given 
in  (1.1.4).  Under  this  loss,  the  best  equivariant  estimator  of  a^"  is  given  by  its 
UMVUE,  d^S"  where 

Ti\{n-l]k) 

-  r(l((n-l)^^  +  2a))  (2.1.1) 
We  aim  at  finding  two  classes  of  HB  estimators  of  a-°'  under  the  loss  (1.1.2) 
which  dominates  a-°.  The  derivation  of  such  HB  estimators  is  given  in  Section  2.2. 
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The  dominance  of  such  estimators  over  d„S°  is  proved  by  appealing  to  a  theorem  of 
Kubokawa  (1991). 

In  Section  2.3,  we  find  the  Highest  Posterior  Density  (HPD)  credible  intervals  for 
(7^"  (  or  o-^  )  under  two  classes  of  hierarchical  priors.  We  then  demonstrate  that  such 
intervals  have  nice  frequentist  properties  as  well  in  the  sense  that  they  have  smaller 
length  and  greater  coverage  probability  (in  the  frequentist  sense)  than  either  the 
minimum  length  or  the  shortest  unbiased  confidence  intervals  based  only  on  5,  the 
error  sum  of  squares.  Also,  two  more  classes  of  HPD  credible  intervals  are  considered, 
one  of  which  includes  as  its  members  the  ones  considered  earlier  by  Shorrock  (1982. 
1990).  These  latter  intervals  have  the  same  length  but  higher  coverage  probability 
thaji  the  usual  intervals  based  only  on  S.  An  interesting  review  of  interval  estimation 
of  (7^  is  given  in  Maatta  and  Casella  (1990). 

In  Section  2.4,  some  numerical  calculations  are  provided  to  indicate  the  extent  of 
risk  improvement  of  such  estimators  over  daS".  Also,  some  simulation  studies  are 
performed  to  show  that  the  proposed  classes  of  HPD  credible  intervals  have  a  higher 
coverage  probability  than  the  minimum  length  and  shortest  unbiased  confidence  in- 
tervals based  on  S  only. 

2.2    The  HB  Model 

Consider  the  following  balanced  fixed  effects  one-way  ANOVA  model 

=  ^.  +  e,j     (j  =  1, . . . ,  n;   t  =  1. . . . ,  k), 

where  the  0,  are  unknown  fixed  effects,  and  the  e,j  are  i.i.d.  N(Q,<t^).  The  minimal 
sufficient  statistic  is  (Fi, . . . ,  Yl,  5),  where  Y,  =  n'^H'J^^Y.j  (i  =  1, . . . ,  k),  and  5  = 
S*^iS;^i(r.j  -  Y,)\  We  use  the  notation  Y,  for  Y    d  =  l,...,k).  Also,  let  Y  = 
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(Fi, . . . ,  YkV,  9  =  (6-1,.. . ,  9k)^.  We  consider  the  estimation  of  cr^"  under  the  entropy 
loss  given  in  (1.1.2). 

We  first  consider  a  hierarchical  Bayes  model  which  is  constructed  by  using  in- 
dependent normal  priors  with  known  means,  say  zeroes,  and  a  common  unknown 
variance  at  the  first  stage  for  the  k  cell  means.  In  the  second  stage,  diffuse  priors 
are  assigned  to  the  first  stage  prior  variance  and  the  error  variance,  a^.  The  models 
considered  in  Brewster  and  Zidek  (1974)  and  Strawderman  (1971,  1974)  are  members 
of  this  class.  Such  priors  are  used  for  simultaneous  estimation  of  means  by  Morris 
(1983)  and  Ghosh  (1992a).  The  hierarchical  model  is  given  in  (I)  -  (III)  below,  where 
R  =  a-\ 

(I)  Conditional  on  0  =  0,  R  =  r  and  A  =  A,  y  and  5  are  mutually  inde- 
pendently with  Y  ~  N{e,  (nr)-^  Jfc)  and  S  ~  r-'^x(n-i)k^  ("  ^  2); 

(II)  Conditional  on     =  r  and  A  =  A,  0  ~  iV(0,  (Ar)-ilfc); 

(III)  R  and  A  are  marginally  independently  distributed,  where  R  has  pdf 
/fl(r)  a  r-5»  {0  <  a  <  nk  +  2)  while  A  has  pdf  /a(A)  oc  A-J**  (n  + 
A)-5(*-^). 

Writing  U  =  it  is  shown  in  Ghosh  (1992b)  that  a  version  of  Jeffreys'  prior 
based  on  (I)  and  (II)  only  is  a  member  of  the  above  class  of  priors.  From  (I)  -  (III) 
one  has 

(i)  conditional  onY  =  y,S  =  s,R  =  r  and  U  =  u, 

6>~iV((l-u)y,  {rnr'h); 

(ii)  conditional  onY  =  y,S  =  s  and  U  =  u, 

R  ~  Gamma  [^(5  -f  ut),  -{nk  -  a  +  2)]  , 
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where  t  =  nSjLij/,-  •    [        ^^Y  ^  ~  Gammaia,  /5)  if  it  has  pdf 

fz{z)  oc  exp(— az)z'^"^/(o,oo)(2),  a  >  0,  ^  >  0.  and  /  is  the  usual  indicator 
function.  ] 

(iii)  conditional  on  Y"  =  y  and  5  =  5,  the  pdf  of  U  is  given  by 

Mu\y,s)  ~  u5('=-''Hl  +  uti;)-i("''-+^»/(o.i)(u),  (2.2.1) 
where  w  =  t/s. 

For  the  loss  (1.1.2),  the  Bayes  estimator  of  a-""  =  i?-"*  is  given  by  [E{R°  \Y ,  S)]-\ 
Now,  from  (i)  -  (iii),  we  have 

E.iR-\u^y,s)    =    ra(n^---  +  2  +  2a)) 

'      ^'    ^  ^(i(n^'-a  +  2))       V    2  / 

r(|(nA:  -  g  +  2  +  2a))  Z^' 


A(nfc-a  +  2  +  2a)) 
r(l(n^^-a  +  2))  Uj 


Therefore,  under  the  above  loss,  the  hierarchical  Bayes  estimator  of  a^"  (a  >  0) 
is  given  by 


E{R'-\Y,S) 

/5N°     r(|(nfc-a  +  2))        ,  , 


■2/    Y[\{nk-a^2  +  2oc)) 
=        ^a,6,a(ty),     say.  (2.2.2) 

We  appeal  to  the  following  version  of  Theorem  2.1  of  Kubokawa  (1991)  which 
provides  sufficient  conditions  under  which  e^^^(y .  5)  dominates  the  best  equivariant 
estimator  eo,c,(S)  =  d„S°  of  a^"  under  the  loss  (1.1.2). 

Theorem  2.2.1  Suppose 

Caj  (f>a,b.a{w)     T  in  w,  and  lim  (^a,bAw)  =  d^; 


F  • 


90a('W)  =  :  1  

Then  e^^„(y,  5)  dominates  eo^a{S)  under  the  loss  (1.1.2),  i.e., 


r2a 


r2a 


+  In 


r2a 


for  all  e,  a^. 
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<  0 


Note  that  <^o,a(if )  can  be  simplified  further  as 

/o~  Jo    exp(-i5(l 55"''-i  dxds 


/o~  r  exp(-i3(l  +  x))x5(*-2)55"^+°-i  dxds 


=  2- 


r(f )    lo  x5(fc-^)(i  +  x)-^<fx 
x'^c-'^ii  +  xy'^dx 


=   h.2,cM.     using  (2.2.1)  and  (2.2.2) 


(2.2.3) 


We  confine  our  attention  to  the  subclass  of  HB  estimators  e^^„(l^,  5)  of  a^".  The 
next  theorem  provides  conditions  for  a  under  which  (i>a,a,a{'^)  satisfies  the  conditions 
of  Theorem  2.2.1. 

Tiieorem  2.2.2  Let2<a<  k  +  2.  Then,  (t>a,a,a{w)  satisfies  conditions  (a)  and  (b)  of 
Theorem  2.2.1. 


Proof  We  first  prove  condition  (b).  For  every  0  <  a  <  ^  +  2,  using  (2.2.1).  we 
simplify  (^a,a.a{'^)  given  in  (2.2.2)  as 


o-c     n\{nk  -  g  +  2))  Ji'-^m  +  uti;)-|<»^-°^^)(fu 

r(i(nfc  -  a  +  2  +  2a))  /J  u5('=-»)(i  +  uw)-^^''''-''^^+^''Uu 
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=  2" 


T{i{nk-a  +  2)) 
T{\{nk  -  a  +  2  +  2a)) 


=  2 


^  rlf_uu^Ni(fc-a)(_!_U((n-l)fc-2+2a)  u^^^ 

T(i{nk-a  +  2))  Jo"a(''-°Hl-z)'^^"-^"-''^^ 
T{\{nk  -a  +  2  +  2a))     ^^(''-'(l  -  z)5(("-^>''-'+'°>(iz 


(  Putting  z  = 


1  +  uw 


and  V  = 


w 


1  +  iv 


=  2" 


r(i(nA:  -  a  +  2  +  2a))  p  ^Beta{!^,^^^^^f^)  <  v] 


X 


=  2' 


r(i(n-l)fc)  P 

[5e<a(''-f^i^)  <  u] 

r(i(n-  l)fc  +  2a)p 

Beta{!^,i^^^l^)<v 

where  Beta{m,p)  stands  for  a  beta- variable  with  parameters  m  and  p. 
Since  <i>o.a{w)  =  (i>2,2,a{w),  one  has 


<^a.a.a(u^) 

•^O.al^'') 


P  [5e/a(i(fc  -a +  2),  |(n  -  1)A;)  <  v 


Beta(]{k  -  a  +  2),  ^((n  -  l)k  +  2a))  <  v 
Beta(\k,\{n  -  l)k)  <  v 


2 
P 


(2.2.4) 


(2.2.5) 


P  [Betaip,  i((n  -  l)k  +  2a))  <  v 

The  following  lemma,  proved  in  Ghosh  (1992b),  shows  that  <?io,a,a("')  satisfies  (b) 
of  Theorem  2.2.1  for  2  <  a  <  /t  +  2. 

Lemma  2.2.1  If  0  <  pi  <  p2,  then  for  every  0  <  v  <  I, 

P[Beta{m,pi)  <  v)] 


P[Beta{m,p2)  <  v)\ 


is  I   in  m. 
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To  verify  condition  (a)  of  Theorem  2.2.1,  note  that  v  =  so,  as  u;  —  oo, 

V  ~*  I.  From  (2.2.4),  we  have 

lim  0a,aAw)  =  2    :=rTr  ,  o  ^  =  ^«  (2.2.6) 

To  show  that  <^a,a,a(t^)  is  t  in  w,  that  is,  t  in  v,  define  a  new  pdf, 

Mz)  DC  (1  -  2)^(("-i)'=-2)/(o.„,(z),  (2.2.7) 

Then,  for  0  <  u  <  v',  ^  «  T    in       Since  <Paaa{w)  oc  E{1  -  Zy 

(expectation  being  taken  here  with  respect  to  the  pdf  given  in  (2.2.7)),  it  is  t  in  u, 
that  is  t  in  w. 

We  now  consider  the  second  hierarchical  model.  In  this  case,  A;  >  2.  The  basic 
difference  between  this  model  and  the  previous  one  is  that,  unlike  the  previous  case, 
the  first  stage  prior  mean  is  taken  as  M ,  and  a  diffuse  prior  is  assigned  to  M  as  well 
in  the  second  stage.  The  model  is  described  below. 

(I)'  Conditioned  on  0  =  9,  M  =  m,  R  =  r  and  A  =  X,Y  and  5  are  mutually 
independently  with  Y  ~  N{0,  {nr)-'^Ik)  and  5  ~  r-\x(„_i)fc,  (n  >  2); 

(II)  '  Conditional  on  M  =  m,     =  r  and  A  =  A,  0  ~  N{ml,  {Xr)-'^Ik); 

(III)  '  M,  R  and  A  are  marginally  independent,  with  M  ~  Uniform{ -oo,oo), 

fnir)  oc  r-i"  (0  <  a  <  nk+1)  and  /a(A)  «  A-5''(n+A)-5(*-*)  (b  <  k  +  1). 

Again,  writing  U  =         we  have  from  (I)'  -  (III)', 
(i)'  conditional  on  Y  =  y,  S  =  s,  R  =  r  and  U  =  u, 

0  -  iV  ((1  -  u)y  +  uylk,  {nr)-\{l  -  u)!^  +  uk-'Ull))  ; 
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(ii)  '  conditional  onY  =  y,S  =  s  and  U  =  u, 

R  ~  Gamma  (^2^^  2^^^^  ~    +  1)^  > 

(iii)  '  conditional  on  Y"  =  y  and  5  =  5,  the  pdf  of  U  is  given  by 

Mu\y,s)  ~  J^'-'-'Hl  +  uu;-)-5<"'»-»+i)/(o,i,(u),  (2.2.8) 

where  w*  =  nE*Lj(i/,  —  y)^/^  and  y  =  fc~^SjLi!/,. 

The  HB  estimator  of  (7'°'  ba^ed  on  (i)'  -  (iii)'  is  given  by 

1 


E{R''\Y,S) 


/5n«      T(\{nk-a  +  l)) 

V2/    r  i(nifc-a  +  l+2a))  ^     I    '  1 


-1 


=   -5"  <^:,fc.a(^)'    say  (2.2.9) 

Using  arguments  similar  to  those  in  Theorems  2.2.1  and  2.2.2,  we  now  have  the 
following  theorem. 

Theorem  2.2.3  Under  the  loss  (1.1.2),  e^^  JY,  S)  dominates  eo,<,(5)  for  every  a  € 
[2,A;  +  1). 

Now,  to  do  some  numerical  calculations,  expressions  for  the  risk  improvements  of 
for  0  <  a  <  A:  +  2  and  el^  .,{Y,S)  for  0  <  a  <  A:  4-  1  over  eo,i(5)  under 
the  loss  (1.1.2)  for  different  values  of  n,  k  and  a  are  required. 
Note  that  for  a  =  1.  e^^i(r,  5)  simphfies  to 

^HB       S)    =    1  nlink  -a +  2)     z^(fc-°)(l  -  z  )t(i-^)''-Vdz 
^'"''^    '  2r(i(nA;-a  +  4)   j^^  ,^,i^-<^)(l  -  z)'.(r.-i)k)^^ 
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1 


nk  —  a  +  2 


1 


nk  —  a  +  2 


1  + 


k  -  a  +  2 
(n  -  l)k 


2  25(fc-<'+2)(l  _  ^)i(n-l)fc 

(n  -  l)k  -  z)^(("-i)'=)rf2 


(n  - 


where  4>a{V) 


[1  -  <^a(V)] 


(2.2.10) 


nk-a  +  2  jv^^(^^_^^i^^^ 
The  next  theorem  provides  an  expression  for  the  risk  difference  of  eo,\{S)  and 


«a,a,i{Y,S)  under  the  loss  (1.1.2). 

Theorem  2.2.4  Consider  the  model  under  which  Y  and  S  are  mutually  independent 
with  Y  ~  N(0,  n-^a^Ik)  and  S  ~  Then  {or0<a<k  +  2, 


[I(eo,x(5),  (7^)  -  L{e^ZiiY,S),a')] 
=  Elink  +  2L)AL,a] 


(2.2.11) 


where 


AL.a  =  E 


\{\-V).i>,[V)  log{\-MV))\ 
\     {n-l)k  nk  +  2L  J 


(2.2.12) 


where  L  -  Poisson  (3^  ELi  and  V\L  ~  Beta  [\{k  +  2L),  i(n  -  l)k). 
Proof.       Let  T  =  n  ^JLj      ^^nd  G  =  T  +  5  so  that 


V 


w 


T/S 


1  +  W     1  +  T/S  G' 
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Then  eo,i(5)  =  GxI;,{V)  and  e^^,(r.5)  =  GV'a(V)  where  t*(V) 
i'aiV)  =  •  estimator  GV'(y)  of       one  has 


(n-l)fc 


and 


=    E,,,,,[Gtl;{V)-logG-log^{V)-l]  (2.2.13) 

If  we  write  t]  =  {2a^)~^  Ilf=i  then,  under  the  reparametrization  (&/cr,  1),  5  and 
T  are  independently  distributed  with  S  ~  X(n-i)fc  ^^"^  ^  ~  Xi(^)-  Introducing  the 
dummy  variable  L  ~  Poissonirj),  V  and  G  are  independent  conditional  on  L  with 
7|I  ~  Beta        +  2L),  i(n  -  l)fc)  and  G|L  ~  xlk+2L-  Then, 


G(l  -  V-)</^,(y) 
{n-l)k 


+  log{l  -  <i>,{V)) 


=  -£'«/. 


<T,1 


(nk + 2L) E 


nk  +  2L  J 


(n  -  l)fc 

The  next  theorem  provides  an  expression  for  the  risk  difference  of  eo,i{S)  and 
<,a.i(^.'?)  under  the  loss  (1.1.2). 

Theorem  2.2.5  For  Q  <  a  <  k  +  1  and  k  >  2, 


E,,,.[L(eoAS),a')  -  L{el,,,{Y ,  S),a')] 
=  E[{nk-l+2L')Al.J 


where 


E 


{n-l)k  nk-l+  2L*  ] 


L' 


(2.2.14) 


(2.2.15) 
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L'  ~  Pmsson  {^ZLii^^  -  ^)')  and  V\L*  ~  Beta        -  I  +  21*),  i(n  -  Ufc)  and 


k-a+l  ,  ,  (n-l)li 

^•^  ^  2  t;-i^(l-t;)^ 


The  proof  is  omitted  because  of  its  similarity  to  the  proof  of  the  previous  theorem. 

2.3    Interval  Estimation  of 

This  section  is  devoted  to  the  comparison  of  different  confidence  intervals  for  a- 
based  on  the  hierarchical  models  given  in  Section  2.2.  Using  5  alone,  the  minimum 
length  confidence  interval  for      with  confidence  coefficient  1  —  a  is  given  by 

Ca(5)  =  {(7^  :  ci5  <  a' <  C25},  (2.3.1) 

where  Ci  (  >  0  )  and  C2  (  >  ci  )  are  obtained  from 

/(n-i)fc+4  ^— j  =  /(n-i)fc+4  and  y^"'  /(„_i,fc(x)(ii  =  1  -  a,  (2.3.2) 

jv[x)  being  the  pdf  of  x^,  a  chi-sqaure  variable  with  v  degrees  of  freedom  (  Tate  and 
Klett,  1959;  Maatta  and  Casella,  1990  ).  On  the  other  hand,  the  shortest  unbiased 
confidence  interval  for  cr^  with  confidence  coefficient  1  —  a  is  given  by 

Ct{S)  =  {a' -.dxS  <a'' <dtS},  (2.3.3) 

where  c/j  (  >  0  )  and  ^2  (  >      )  are  obtained  from 

/(„_i)fc+2  (— j  =  /(„-i)fc+2  ( j-j  and         /(„_i)fc(a;)rfi  =  1  -  a.  (2.3.4) 

We  now  find  HPD  credible  intervals  for  (y~  under  the  hierarchical  priors  considered 
in  (I)  -  (III)  and  (I)'  -  (III)'.  We  shall  show  that  a  subclass  of  such  HPD  credible 
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intervals  have  smaller  length  as  well  as  greater  coverage  probability  (in  the  frequentist 
sense)  than  the  intervals  described  in  (2.3.1)  and  (2.3.3). 

Consider  first  the  class  of  priors  given  in  (I)  -  (III).  Also,  we  confine  attention  to 
intervals  of  the  form 

C3{W,  S)  =  {a^  :  ai<i>{W)S  <  a'  <  a2<i>{W)S},  (2.3.5) 

where  ai  (  >  0  )  and  02  (  >  ai  )  are  arbitrary  constants,  while  0  <  (j){W)  <  1.  Recall 
that      =  r/5  and  r  =  n  y;'- 

We  first  find  within  the  class  of  all  intervals  of  the  form  (2.3.5)  the  optimal 
<j>o{W)  which  leads  to  the  HPD  credible  interval  based  on  the  prior  given  in  (I)  - 
(III).  Specifically,  the  following  theorem  is  proved. 

Theorem  2.3.1  Consider  the  class  of  priors  given  in  (I)  -  (III)  with  {n  —  l)k  +  b  —  a  >  0 
and  6  <  k  +  2.  Then,  within  the  cleiss  of  intervads  of  the  form  (2.3.5),  the  optimal 
<i){w)  is  given  by  <f>o{w),  where  (i)o{w)  is  a  solution  of 

/(n-l)fc+6-a+4  {  T"}  ^        )  (   ^  ) 

=    /(„-i)fc+fc-a+4  (  — - — T — )  -Pfc-6+2  (—7 — ; — )  ,  (2.3.6) 
where  f^{x)  denotes  the  pdf  of  xl,  and  F^{x)  =  P{xl  <  x). 

Proof.  As  before,  write  R  =  {(7^)-\  Then  from  (I)  -  (III),  the  joint  posterior  pdf 
of  R  and  U  =  A/(n  +  A)  given  Y  =  y  and  5  =  s  is 

/(r, u\y,s)  oc  exp[-^r(3  +  ut)]r^^''''-''U^''''-''\  (2.3.7) 
Now  integrating  with  respect  to  u  in  (2.3.7),  one  gets 

f{r\y,s)    cc    eip(-ir5)r^«"-i)''+^-»»-ip(xL6+2  <  ^'•) 

=    expi-'^r3)M"-'^''^'-'^-'P{xl_,^,  <  wrs).  (2.3.8) 
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Next  we  need  to  show  that  f{r\y,s}  is  log-concave.  Since  exp{—^rs)r^^^"~^^'''^''~''^~^ 
is  log-concave,  it  suffices  to  show  that  P{xi-b+2  —  ^'')  log-concave.  This  fact  is  an 
immediate  consequence  of  Lemma  1.3.2. 

Since  the  posterior  pdf  of  R  given  Y  =  y  and  5  =  5  is  log-concave,  using  (2.3.8) 
the  HPD  credible  interval  for  Rs  within  the  class  of  all  intervals  of  the  form  (2.3.5) 
is  given  by  ( ,  )  where  ^w)  is  obtained  by  maximizing 

/J^  exp(-l^).^(("-^>''+^-»)-p(xU^3  <  ^^)dz  (2.3.9) 
with  respect  to  (i>{w).  This  leads  to  the  solution 

which  is  equivalent  to  (2.3.6)  since  /^(x)  denotes  the  pdf  of  xl-  This  proves  the 
theorem. 

Next  we  improve  on  confidence  intervals  of  the  form  (2.3.1)  and  (2.3.2)  in  a 
frequentist  sense  using  a  subclass  of  HPD  credible  intervals  described  in  (2.3.5). 
Denote  the  interval  given  in  (2.3.1)  and  (2.3.2)  as  Jml,  and  let  denote  an  interval 
of  the  form 

{cr^  :  ci(f>{w)s  <      <  C2<i>iw)s},  (2.3.11) 

where  Ci  and  are  determined  from  (2.3.2).  Also,  we  denote  by  L{I)  the  length  of 
an  interval  I. 

We  consider  the  special  case  b  =  a  {  <  k  +  2  ).  If  we  now  set  aj  =  Ci  and  03  =  cj, 
from  Theorem  2.3.1,  we  have  found  a  class  of  HPD  credible  intervals  of  the  form 

{a^  :  Ci<i>oiw)3  <  cr^  <  CiM^M  (2.3.12) 
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for  cr^  where  0o('w)  satisfies 

=  (— 1— (-^^  (2.3.13) 
We  now  prove  the  following  theorem. 

Theorem  2.3.2  For  an  interval  given  in  (2.3.12)  and  (2.3.13)  with  0  <  M^)  <  1, 
L{I^)  <  LUml)  ^nd  P^2{<t'  €  /^)  >  P.=  (cr=  G  Iml)  for  all  a\ 

Proof.  We  appeal  to  Theorem  3.1  of  Kubokawa  (1991)  in  the  special  case  of 
chi-squared  distribution.  By  definition  L{I^)  <  L{Iml)-  To  prove  that  /^^  hM  at 
least  as  large  a  confidence  coefficient  as  /ml,  following  Kubokawa  (1991),  we  need  to 
verify 

(i)  0o(if)  is  t  in  tu  and  lim  (i>o(w)  =  1, 

Ml— too 

First  we  prove  (i).  Making  u;  —  oo,  we  get  from  (2.3.13) 

/(n-l)fc+4  [-77.  r7~U~  I   ~  f{n-\)k+i  \  JT.  ,    ,  )  (2.3.14) 

\(lim^^oo 'Po(u'))ci/  V(lim^^oo '?f'o(tf))c2/ 

Since  /(n-i)fc+4(^)  =  /(n-i)fc+4(^),  and  /  is  a  chi-squared  pdf,  we  must  have 
lim  =  1. 

Ill— »oo 

Next  to  show  that  Mw)  is  t  in  w,  differentiate  both  sides  of  (2.3.13)  with  respect 
to  w,  and  get 

<^o{w){A  +  wB)  =  Mw)B,  (2.3.15) 
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where 


A  =  A{w)   =  -/('„_i)fe+4 


C2 


k-a+2 


1 


~~-^('n-l)fc+4 


0+2 


(2.3.16) 


and  B  =  B{w)   =    — /(„_i)fc+4  ( )  /fc-a+2  (  T~"  ) 

(  4>0Cl  )  '^'^  "'''^  (  (f>QCi  I 


 /(n-l)fc+4 


(2.3.17) 


To  show  that  S  >  0,  we  have  from  (2.3.13) 

^  =  (^)  (^) 

W     /fc-a+2  (5^)  IW  /fc-a 


to 

>  0 


(2.3.18) 


since  x/„(z)/F„(x)  is  J.  in  x  for  every  v  by  Lemma  1.3.3,  and  0  <  —  <  — .  Now, 


using  (2.3.16)  and  (2.3.13), 

^  —  /(n-l)fc+4 

X(^0 
>  0 


tw 


fc-o+2 


1     -^/n-lJM^U^)  1  ■^('n-l)fc+4(5;^) 


(n-l)fc+4V,j,0Ci 


^0C2  /(„-l)fc+4(^)        <^0Cl  /(n-l)fc+4(^) 


(2.3.19) 


since  x/^(x)/^(x)  =       -  2  -  xj  is  i  in  x  for  every  i/,  and  0  <  ^  <  ^.  It  follows 
from  (2.3.15)  -  (2.3.19)  that  4>'^{w)  >  0,  i.e.  <j)o{w)  is  T  in  w. 
Next  to  prove  (ii),  use  (2.3.13)  to  get 

I      \  -  I     w     \      _  /  1 


fin- 


l)fc+4 


/(n- 


ci<po(iy) 


Ci<i)o{w) 


-fin- 


l)fc+4 


,C2<^o(u;)^ 


to 


1 


l)fc+4 
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/(n-l)fc+4  {  —  ) 

>  0, 


Cl<poJ  \C2<P0  \C2<P0  /  \Ci<po 


putting  m  =  k  —  a-{-2{a<k  +  2),  n  =  fc,  y  =        and  x  =        in  Lemma  1.3.1. 
This  completes  tiie  proof  of  Theorem  2.3.2. 

To  improve  on  intervals  given  in  (2.3.3)  and  (2.3.4),  put  a,  =  di  (i  =  1,2)  and 
choose  b  =  a  —  2  and  a  <  ^-  +  4  in  (2.3.6)  to  get  <^o  from 

^'"-^"^^^  (^)  ^"-^  (^)  =  ^'"-^^''^^  [pk]  ^'""^^  (^)  ^'-^-'^^ 

Then  following  the  line  of  the  proof  of  Theorem  2.3.2,  one  can  show  that  L{I^)  < 
L{Isu)i  where  Isu  denotes  the  shortest  unbiased  confidence  interval  given  in  (2.3.3) 
and  (2.3.4),  and  P„z{a^  e  I<t„)  >  P„i{(T^  G  Isu)  for  all  c^^  where  (^o  is  defined  in 
(2.3.20). 

For  the  class  of  priors  defined  in  (I)'  -  (III)',  first  analogous  to  Theorem  2.3.1, 
define  a  class  of  confidence  intervals  of  the  form  (2.3.5),  where  is  a  solution  of 

1      \  „         f      w  \ 


/(n-l)fc+6-a+4  (   ^        )  -Pfc-fc+1  ( 

\(po{w)aiJ  \ 


1 


w 


=   /(„_i)fc+6-a+4   X7-T-   Fk-b+i  (2.3.21) 
\(po{w}a2j  \(pQ{w)a2) 

Such  intervals  are  HPD  within  the  class  of  intervals  given  in  (2.3.5)  under  the  prior 

(I)'  -  (III)'.  Now  putting  6  =  a  -  2  and  a  <  A;  +  3,  a.  =  Ci  (z  =  1,2),  (2.3.21)  reduces 

to 


1 


/(n-l)fc+2  I  I  ^fc-»+3  [~-]   =  f{n-l)k+2  (  —  I  ^fc-a^3  (  ^  )  (2.3.22) 


,  <P0Cl  /  \  (poCi  J  V  '^0C2  )  \  (p0C2  , 

The  resulting  interval  7^  satisfies  L{1^)  <  L{Isu)  and  P^zia^  G  7^)  >  P^2(a^  € 
Isu)  for  all  a-.  Thus,  we  have  produced  different  classes  of  HPD  intervals  which 
dommate  the  minimum  length  or  shortest  unbiased  confidence  intervals.  Unlike  the 
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intervals  of  Cohen  (1972)  and  Shorrock  (1990)  which  have  the  same  length,  our 
intervals  have  smaller  length  and  greater  coverage  probability.  The  intervals  proposed 
by  Goutis  and  Casella  (1991)  are  different  from  ours  although  they  achieve  the  same 
objective  as  ours. 

However,  it  is  also  possible  to  develop  HPD  intervals  using  the  formulation  of 
Cohen  (1972)  and  Shorrock  (1982,  1990).  Under  their  formulation,  the  minimum 
length  confidence  interval  for      with  confidence  coefficient  1  —  a  is  given  by 

C^{S)  =  {(7^  :  ai5  <      <  (ai  +  ci)5},  (2.3.23) 

where  ai  (  >  0  )  and  Ci  (  >  0  )  are  obtained  from 

/(n-i)fc+4  f— )  =  /(n-i)fc+4  {  ;  )  and   /"\     /(„_i)fc(x)£ii  =  1  -  a,  (2.3.24) 

/„(i)  being  the  pdf  of  x^,  a  chi-sqaure  variable  with  v  degrees  of  freedom  (  see  Maatta 
and  Casella,  1990  ).  On  the  other  hand,  the  shortest  unbiased  confidence  interval  for 
(7^  with  confidence  coefficient  1  —  a  is  given  by 

C^{S)  =  {<7^  :  a^S         <  (aj  +  €2)8},  (2.3.25) 

where  03  (  >  0  )  and  C2  (  >  0  )  are  obtained  from 

/(n-i)fc+2  (— )  =  /(n-i)fc+2  ( — \ — )  and  f(r,-i)k[x)dx  =  1  -  a.  (2.3.26) 

We  now  find  a  class  of  HPD  credible  intervals  for  <7"  under  the  hierarchical  priors 
considered  in  (I)  -  (III)  and  (I)'  -  (HI)'.  A  subclass  of  such  HPD  credible  inter- 
vals have  greater  coverage  probability  (in  the  frequentist  sense)  than  the  intervals 
described  in  (2.3.23)  and  (2.3.25). 

Consider  first  the  class  of  priors  given  in  (I)  -  (III).  Also,  we  confine  attention  to 
intervals  of  the  form 

Cs{W,S)  =  {a^  ■.o{W)S  <a^  <[^{W)  +  c)S},  (2.3.27) 
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where  c  (  >  0  )  is  an  arbitrary  constant,  while  0  <  <p{W)  <  1. 

The  optimal  MW)  within  the  class  of  all  intervals  of  the  form  (2.3.27)  which 
leads  to  the  HPD  credible  interval  based  on  the  prior  given  in  (I)  -  (III)  is  obtained 
from  the  following  theorem. 

Theorem  2.3.3  Consider  the  class  of  priors  given  in  (I)  -  (III)  with  {n—l)k  +  b  —  a  >  0 
and  b  <  k.  Then,  within  the  class  of  intervals  of  the  form  (2.3.27),  the  optimal  4>{w) 
is  given  by  <f)o{w),  where  <i>o{'w)  is  a  solution  of 

f(n-l)k+b-a+4  (   T  )  Fk-b+2  (  "T-;  \] 

=     /(n-l)fc-H>-a+4  (    ,    ,     s    ,       )  Fk-b+2  \    ,  ,     s    ,       )  »  (2.3.28) 

where  /„(x)  denotes  the  pdf  of  xl,  and  F„{x)  =  P{xl  <  x). 

The  proof  of  this  theorem  is  omitted  due  to  its  similarity  to  the  proof  of  Theorem 
2.3.1. 

Next  we  improve  on  confidence  intervals  of  the  form  (2.3.23)  and  (2.3.24)  in 
a  frequentist  sense  using  a  subclass  of  HPD  credible  intervals  described  in  (2.3.28). 
Denote  the  interval  given  in  (2.3.23)  and  (2.3.24)  as  /ml,  and  let  denote  an  interval 
of  the  form 

{(7^  :  <i>{w)s  <cr^  <  {(f>{w)  +  Ci)s},  (2.3.29) 

where  Ci  is  determined  from  (2.3.24). 

For  the  special  case  6  =  a  (  <  -f-  2  ),  we  have,  from  Theorem  2.3.3.  a  class  of 
HPD  credible  intervals  of  the  form 

{(T^  :  Mw)s  <<^^  <  (M^)  +  ci)^}  (2.3.30) 
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for      where  0o{w)  satisfies 

1     \  „         (  w 


=     /(n-l)fc+4  ( J-4— )  f-— ^5^)  (2.3.31) 


We  now  prove  the  following  theorem. 

Theorem  2.3.4  For  an  intervad  given  in  (2.3.30)  and  (2.3.31 )  with  0  <  (poiw)  <  a^, 
L{I^)  =  L{Iml)  and  P,2{a^  G  I<t>o)  >  PA(^^  €  Iml)  for  all  a\ 

Proof.  We  again  appeal  to  Theorem  3.1  of  Kubokawa  (1991)  in  the  special  case 
of  chi-squared  distribution.  By  definition  L^I^,^)  =  L{Iml)-  To  prove  that  /^^  has  at 
leMt  as  large  a  confidence  coefficient  as  ImLi  following  Kubokawa  (1991),  we  need  to 
verify 

(i)  (i>o{w)  is  t  in  10  and  lim  (i)o{w)  =  ai, 

(ii)  /(„-i)fc+4  (^)  Fk  (^)  >  /(„-i)fc+4  (^(J,+cJ  (*o(w)+cJ- 

First  we  prove  (i).  Making  tx;  -+  oo,  we  get  from  (2.3.31) 

^'--'^'^^  (lim._l  ^o(t.))  =  (lim._(M-)  +  c,))  (2.3.32) 

Since  /(„_i)fc+4(^)  =  /(„_i)fc+4(^^^),  and  /  is  a  chi-squared  pdf,  we  must  have 
lim  (hoiw)  =  ai. 

tu— »oo 

To  show  that  (i>o{w)  is  t  in  ly,  consider  Wi  and  7i;2  such  that  0  <  <  103.  Here. 
(?i>o(ti'i)  and  4>q(w2),  respectively,  maximize 


and 


/*^7'  /("-l)fc(^)-^fc-a+2(iyiZ)<ii 


/(„_l)fc(2:)-F'fc-a+2(«'22)<i-Z. 
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To  show  that  (Pq{wi)  <  (i)oiw2),  use  Lemma  1.3.4  and  Lemma  3.1  of  Shorrock  (1990), 
which  is  stated  here. 

Lemma  2.3.1  Let  f{x)  and  g{x)  be  two  unimodal  densities  and  let  <f>  =  (pf  maximize 
j^'^"  f{x)dx  and  =  <pg  maximize  J^'^"  g{x)dx.  Then,  if  f{x)^g(x)  is  an  increasing 
function  of  x,  (pf  >  <f)g. 


/(n-l)fc+4  (  TT  T  1  -^fc  \  ~ri  '\  \  ~  f{n-l)k-\-4 


Next  to  prove  (ii),  use  (2.3.31)  to  get 

w 

<i>o)         "'^^  \4>Q  +  Ci  J  ''\(i>0  +  ClJ     "  V*^0 


/(n-l)fc+4  (^) 

>  0, 


putting  m  =  k  -  a  +  2  {a  <  k  +  2),  n  =  k,  y  =  ^  and  x  =         in  Lemma  1.3.1. 

This  completes  the  proof  of  Theorem  2.3.4.  Also,  we  may  note  that  the  class  of 
intervals  given  in  (2.3.30)  includes  as  its  members  the  ones  considered  by  Brewster 
and  Zidek  (1974)  and  Shorrock  (1990)  for  the  special  case  a  =  b  =  2. 

To  improve  on  intervals  given  in  (2.3.25)  and  (2.3.26),  choose  b  =  a  -  2  and 
a  <  /:  +  4  in  (2.3.28)  to  get  (^o  from 

A-'-'"  (^)  =  (^)  (^)  (2-3-33) 

Then  following  the  line  of  the  proof  of  Theorem  2.3.4,  one  can  show  that  P„2{a^  e 
I<h)  >  ■P<t2(o-^  e  Isu)  for  all  a^,  where  4>q  is  defined  in  (2.3.31)  and  Isu  denotes  the 
shortest  unbiased  confidence  interval  given  in  (2.3.25)  and  (2.3.26). 

For  the  class  of  priors  defined  in  (I)'  -  (III)',  first  analogous  to  Theorem  2.3.3, 
define  a  class  of  confidence  intervals  of  the  form  (2.3.27),  where  (po{w)  is  a  solution 
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of 

/(n-i)fc+fe-a+4  (— - — rj  i -—, — :  ) 

=     /(„-l)fc+6-a+4  [-7-,  T-          )  (   T"^         )  (2.3.34) 

Such  intervals  are  HPD  within  the  class  of  intervals  given  in  (2.3.27)  under  the  prior 
(I)'  -  (III)'.  Now  putting  6  =  a  -  2  and  a  <    +  3,  (2.3.34)  reduces  to 

^^"-^'""^  {£)  ^''-""^  (^)  =  (^)  ^'-"'^  (^) 

The  resulting  interval       satisfies  P„2{a^  6  /^)  >  P^^icr^  G  hu)  for  all  a^. 


2.4    Numerical  Results 


Tables  2.1  and  2.2  provide  the  numerical  computations  of  the  risk  improvement 
°f  ^a,a,iO^^^)  over  the  best  multiple  estimator  S/{n  -  l)k  of  given  by  (2.2.11) 
and  (2.2.12)  for  k  =  5,10,  n  =  2,  3,  a  =  1, 2,  3  along  with  several  sets  of  values  of 
9i,  1  =  1,...,/:.  Tables  2.3  and  2.4  provide  the  risk  improvement  of  e* ^(y ,  5)  over 
the  best  multiple  estimator  S/{n-  l)k  of  given  by  (2.2.14)  and  (2.2.15)  for  the 
same  values  of  k,  n,  a  and  i  =  l,...,k.  The  numerical  computations  were  done 
using  the  mathematical  software  Mathematica  (Wolfram,  1988). 

It  follows  from  these  tables  that  risk  improvement  of  e^f^{Y,  S)  over  S/{n-l)k 
can  often  be  quite  substantial  even  for  k  as  small  as  5.  For  fixed  k  and  a,  the  risk 
improvement  seems  to  be  decreasing  in  n.  It  is  our  conjecture  that  the  subclass  of 
proposed  HB  estimators  where  2  <  a  <  A;  +  2  is  an  admissible  class  of  estimators  for 
the  variance  under  entropy  loss. 

The  percent  risk  improvement  of  e^^^.^{Y,S)  over  S/{n-l)k  is  plotted  agamst 
the  non-centrality  parameter  given  by  77  =  uiEtiS,  / 2a^  for  k  =  5,  n  =  2,  and 
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k  =  10,  n  =  2  in  Figures  2.1  and  2.2,  respectively.  The  three  lines  represent  the 
percent  risk  improvement  for  a  =  1,2,3. 

To  illustrate  the  results  for  the  HPD  credible  intervals  for  a',  random  samples 
were  generated  for  a  fixed-effects  balanced  one-way  ANOVA  model  with  k  =  .5,  n  =  2, 
and  =  1,  and  two  sets  of  values  of  Oi.  The  values  of  ci  and  C2  given  by  (2.3.1) 
and  (2.3.2)  were  first  obtained  with  confidence  coefficient  1  -  a  =  0.95.  Then  <pQ 
was  obtained  using  the  expression  given  in  (2.3.13).  This  was  repeated  for  2000 
replications  for  each  combination  of  k,  n,  a,  and  ©.  Table  2.5  gives  the  probability 
of  coverage  and  Table  2.6  gives  the  average  (po  from  each  of  these  2000  replications. 
The  value  of  (i>o  is  the  ratio  of  the  length  of  the  HPD  credible  interval  to  the 
length  of  the  minimum  length  credible  interval  Iml-  Although  the  improvement  in 
the  coverage  probability  is  not  very  large,  a  fairly  large  reduction  in  the  length  is 
obtained. 
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Table  2.1.  Percent  Risk  Improvement  of  ejji(y,  S)  over  S/{n  -  l)k  for  k  =  .5 


n  =  2 

n  =  3 

a=l        a  =  2       a  =  3 

a  =  I       a  =  2     a  =  3 

^.  =  0  V I 
=  1  V I 
^.  =  i  Vt 
di  =  2t-l  V  t 

-6.6888     0.0000  5.1374 
10.0212    11.6014  11.9838 
14.5122    13.1113  11.1324 
0.7027     0.5051  0.3391 

-6.3054  0.0000  4.3272 
8.8562    9.6882  9.3339 

10.9909  9.1853  7.1180 
0.0317    0.0190  0.0104 

Table  2.2.  Percent  Risk  Improvement  of  ^aaiO^  ■>  ^)  over  S/{n-l)k  for  k  =  10 


n  =  2 

n  =  3 

a  =  l        a  =  2       a  =  3 

a=l        a =2  a=3 

^.  =  0  Vi 
^.  =  1  Vi 

d.  =  \  vt 

^.  =  2z  -  1  Vz 

-5.4041     0.0000  4.4901 
15.3777    15.3715  14.8772 
1.0460     0.8098  0.6095 
0.0000     0.0000  0.0000 

-4.7753     0.0000  3.7204 
11.8702    11.3846  10.5213 
0.0915     0.0623  0.0409 
0.0000     0.0000  0.0000 

Table  2.3.  Percent  Risk  Improvement  of  e*^  ^(y ,  S)  over  S/{n-  l)k  for  k  =  b 


n  =  2 

n  =  3 

a  =  l       a  =  2     a  =  3 

a  =  l       a  =  2     a  =  3 

^,  =  0  V I 

ei  =  i  V  t 
(?<  =  i  vi 

=  2i  -  1  Vi 

-7.2112    0.0000  5.0982 
-7.2112    0.0000  5.0982 
4.0271    7.5384  9.2252 
6.1421    4.5709  3.0861 

-6.4158    0.0000  3.9909 
-6.4158    0.0000  3.9909 
3.4787    6.1658  7.0088 
2.2548    1.4616  0.8415 

Table  2.4.  Percent  Risk  Improvement  of  e;  „  ^(y,  5)  over  S/{n  -  l)k  for  k  =  10 


n  =  2 

n  =  3 

a  =  1       a  =  2       a  =  3 

a  =  I       a  =  2     a  =  3 

=  0  V I 

ei  =  i  V  z 

=  2z  -  1  V  i 

-5.5902     0.0000  4.5481 
-5.5902     0.0000  4.5481 
13.4105    11.9067  10.2688 
0.0077     0.0054  0.0037 

-4.8062    0.0000  3.6538 
-4.8062    0.0000  3.6538 
8.1890    6.8467  5.0832 
0.0000    0.0000  0.0000 
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Table  2.5.  Probability  of  coverage  for  /^i,,  when     =  5,  n  =  2  and  1  —  a  =  0.95. 


n  =  2 

a  =  I     a  =  2     a  =  3 

=  1  Vi 
^,  =  i  Vz 

0.9500    0.9500  0.9500 
0.9510    0.9510  0.9510 

Table  2.6.  Average  value  of  (^o  for       when  ^-  =  5,  n  =  2  cind  I  —  a  —  0.95. 


n  =  2 

a  =  I     a  =  2     a  =  3 

0.8897    0.9138  0.9353 
0.9596    0.9702  0.9796 

CHAPTER  3 

ESTIMATION  OF  THE  REGRESSION  VARIANCE 


3.1  Introduction 

In  this  chapter,  we  address  the  problem  of  estimation  of  a^°-  (a  >  0),  where 
(7-  is  the  error  variance  in  a  Hnear  regression  model  and  introduce  two  classes  of 
hierarchical  Bayes  (HB)  estimators  for  the  two  nested  regression  models, 

r  =  Xi/3i  +  X2/32  +  €  (3.1.1) 

and 

r  =  Xi/3i+6.  (3.1.2) 

These  HB  estimators  dominate  the  best  multiple  estimator  of  5"  (5  being  the  error 
sum  of  squares),  eo,a(5)  =  c^S^^  where 

^  _.-.r(|(n-p  +  2a)) 
r(i(n-p  +  4a))' 

under  the  relative  squared  error  loss 

I(a,a^-)  =  (a/a2--l)'.  (3.1.3) 

Although  the  above  loss  is  considered  in  this  chapter,  similar  results  can  be  obtained 
by  using  the  entropy  loss  given  in  (1.1.2).  Denote  0^  =  (/3f,/3[)  and  X  =  (Xi.Xj) 
where  and  /Sj  are  vectors  of  pi  and  unknown  parameters  and  X  is  the  known 
design  matrix  of  order  n  x  p.  Within  the  class  of  estimators  equivariant  under  location- 
scale  transformations,  the  best  multiple  estimator  eo,a(5)  is  admissible,  but  within 
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the  larger  class  of  estimators  equivariant  only  under  a  transformation  of  scale.  eo,a(5) 
is  no  longer  admissible. 

A  class  of  hierarchical  Bayes  priors  is  considered  for  each  of  the  two  regression 
models.  The  class  considered  for  the  full  model  is  similar  to  the  class  of  priors 
given  in  (I)  -  (III)  of  Section  2.2.  Gelfand  and  Dey  (1988a)  provided  Stein-type 
estimators  of  dominating  the  best  multiple  estimator  eo,i(5).  The  best  multiple 
estimator  can  be  derived  as  the  Bayes  estimator  of  under  the  reference  prior 
7r(/9i, . . .  ,/3p,  (7^)  a"^  as  given  in  Berger  and  Bernardo  (1991).  For  the  reduced 
model,  Ghosh,  Lee  and  Littell  (1990)  introduced  a  modified  version  of  the  priors 
considered  here  for  the  estimation  of  the  response  function. 

In  Section  3.2,  we  develop  a  class  of  HB  estimators  for  <j^°'  for  the  full  model.  It  is 
shown  that  a  subclass  of  these  HB  estimators  dominate  the  best  multiple  estimator. 
In  Section  3.3,  we  do  the  same  for  the  reduced  model.  In  Section  3.4,  we  provide  two 
claisses  of  HPD  credible  intervals  for  cr^,  which  have  shorter  length  and  higher  coverage 
probability  than  the  minimum  length  and  shortest  unbiased  confidence  intervals  based 
on  S  only. 

3.2    HB  Estimator  for  the  Full  Model 

Assuming  X^X  to  be  non-singular,  let  us  denote  the  least  squares  estimator  of 
/3  based  on  the  full  model  as 

^=(X^Xy'  X^Y, 
and  the  least  squares  estimator  of  (3^  under  the  reduced  model  as 


^,  =  {xJXry'x^Y. 
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The  hierarchical  model  is  given  in  (I)  -  (III)  below,  where  R  =  rr'^. 

(I)  Conditional  on  B  =  f3,  R  =  r  and  A  =  A.  Y"  is  distributed  as  Y"  ~ 
N{XI3,  r-iJ„); 

(II)  Conditional  on     =  r  and  A  =  A,  /3  ~  iV(0,  (Ar)-i(X''X)-^); 

(III)  R  and  A  are  marginally  independently  distributed,  with  fnir)  oc  r":" 
and  /a(A)  oc  A-5''(1  +  Xy^^'^-^l 

Writing  U  =         we  have  fuiu)  oc  u":''.  Then, 

(i)  conditional  on  f3,  S  =  s,  R  =  r  and  U  =  u, 

(3^N  ((1  -  u))3,  r-^l  -  u)  (X^X)-')  ; 

(ii)  conditional  on  /3,  5  =  3  ajid  U  =  u, 

R  ~  Gamma      (5  +  ut) ,  ^(n  —  a  +  2)^ 
where  t=^{X^X)0- 

(iii)  conditional  on  /3  and  5  =  3  the  pdf  of  U  is  given  by 

fu{u\y)   oc   uJ(''-''>(l +uti;)-^("-"+''/(o,i)(u). 

(3.2.1) 

Now,  under  the  loss  (3.1.3),  the  Bayes  estimator  of  R"  =  a'^"  is  given  by 
EiR"  \P,3)/E{R^''  \^,s).  From(i)-  (iii),  we  have 

r(i(n-o  +  2a  +  2))  , 

I  ^'  ^)  =  -7777  —TT^E  (5  +  UT)-- 1/3,  5 

r(i(n-a  +  2)j         ^  J 
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and 


EiR""  1/3,5)  = 


r|(n-a  +  4a  +  2) 
'ri(n-a  +  2) 


£;  (5  +  ar)-'"|/3,5 


Under  the  loss  (3.1.3),  the  hierarchical  Bayes  estimator  of  cr^"  is  given  by 

E{R-°'\^,S) 


'a,h, 


E{R-^<^\^,S) 


ri(n  -  a  +  2a  +  2)  ^  [(1  +  ^W)-"|3,  5 


2  7    ri(n-a  +  4a  +  2)£[(l  +  tW)-2«|^,5 


=     5"°  0a,6,a(u'), 


say, 


(3.2.2) 


where  w  =  t/s. 

The  following  version  of  Theorem  2.1  of  Kubokawa  (1991)  provides  sufficient  con- 
ditions under  which  e^^^(3,5')  dominates  the  best  equivariant  estimator  eo^a{S)  = 
c^S"  of  a^"  under  the  loss  (3.1.3). 

Theorem  3.2.1  Suppose 

ti;— ♦OO 


(b)  <f>a,b,a{^)   >   ^o,a(u'),  wiierc 

<^o.a(ty)  = 


C      exp(-i3).5(-p)-i(/-  exp{-h)yl-'dy)d3 


Tien  e^^g,(y3,  5)  dominates  eo(5)  under  the  loss  (3.1.3),  i.e., 


for  all  (3,  a\ 
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Remark.  The  above  inadmissibility  result  extends  to  any  quadratic  loss 
lQ(a,  C7^°)  =  Q{a'°'){a  -  a'")',  Qia'")  >  0. 
Note  that  <f)o,a{w)  can  be  simplified  further  as 


=  2 


/o~  /o"  exp(-i3(l  +  x))xf-^55("+'"^>-Mx<i3 


=   'f>2.2,cM  (using  (3.2.1)  and  (3.2.2)).  (3.2.3) 

We  will  again  restrict  our  attention  to  the  subclass  ^aaai^i  done 
in  Section  2.2.  The  next  theorem  provides  conditions  on  a  under  which  <f>a,a,a(^) 
satisfies  the  conditions  of  Theorem  3.2.1. 

Theorem  3.2.2  Let  2  <  a  <  p  +  2.  Then,  4>a,a,a{'w)  sa.tis£es  conditions  (a)  and  (b)  of 
Theorem  3.2.1. 

Proof.  We  first  prove  condition  (b).  For  every  2  <  a  <  p  +  2,  using  (3.2.1),  we 
simpHfy  (l>a,a,a{yj)  given  in  (3.2.2)  as 

(w)    =   .-g  ri(n  -  g  +  2a  +  2)     u'^jl  +  uti;)-i^"-°+^°^^)c/u 
ri(n  -  a  +  4a  +  2)     u^{l  +  uti;)-^<"-»+''«+2)^^ 

^         ri(n  -  g  +  2a  +  2)  f^'^'^-^  z^jl  -  z)^("-P^^-^)dn 

ri(n  -  g  +  4a  +  2)  ^^(l  _  ,)i(n-p^4a-2,^„  ''^•^•^^ 


(_                                   UW  \ 
Puttmg  z  =  —  
1  +  UW  J 
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ri(n- 

a  +  2a  +  2)P\Beta{^, 

ri(n- 

a  +  ia  +  2)  p  '^Betai^^, 

n^£±^)  <  w/{l+w) 

B 


=  2- 


-p  +  2a)P\Beta{^, 

nz£±l2.)  <  w/{l+w) 

-  p  +  4a) p 

Betai^^, 

(3.2.5) 


p 

[Betal^ 

-a+2 

2  ' 

n—p+2a \ 

2  ' 

<  w/{l  +  u;) 

p 

[Beta(^ 

-a+2 

2  ' 

n-p+ia \ 

2  ' 

<  w/{l  +w)] 

Se<a(f,^i=^)  <w/{l+w) 


Beta{l^^^^)<w/{l+w) 


(3.2.6) 


-  2'  2 

Now,  using  Lemma  2.1.1,  (^0,0,0(1^)  satisfies  part  (b)  of  Theorem  3.2.1. 

The  arguments  required  to  show  that  4>a,a,a{w)  satisfies  part  (a)  of  Theorem  3.2.1 
are  omitted  because  of  their  similarity  to  the  arguments  at  the  end  of  the  proof  of 
Theorem  2.2.2. 


Exact  Risk  When  a  =  1 


We  will  now  obtain  an  exact  expression  for  the  risk  of  e^^i(/3,  5)forO<a<p  +  2 
under  the  loss  (3.1.3).  Note  that  for  a  =  1, 


=  2/(n-p  +  2) 


(3.2.7) 


Now,  using  (3.2.2)  and  (3.2.4),  and  writing  V  =  W / [1  +  W),  we  simplify  the 
expression  for  e^^^(/3,  S)  as 

5   /o''2^(l-z)5("-'')Ju 


Ci(/3,5)  = 


"  -      +  4  /J"  2^(1  -  z)5("-P+2)du 
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n  —  a  +  4 


1  + 


n  —  a  +  4 


^  ^  n  -  p  +  2  ^  n  -p  +  2   /J'  z^(l  -  zjs^^-P+^j^^ 


n  —  p  +  2 


where 


2  yi(p-a+2)(^  _  y^i(n-p+2)^^ 


(3.2.8) 


(3.2.9) 


n  -  a  +  4   />•  2^(1  _  z)j("-P+2)d^, 

The  following  theorem  provides  an  exact  expression  for  the  risk  difference  of 
eo,i(5)  and  e^^i(/3,5)  under  the  loss  (3.1.3). 

Theorem  3.2.3  Consider  the  model  under  which  Y  and  S  are  mutually  independent 
with  Y  ~  N{Xp,  a^I„)  and  5  ~  cr^xl-p-  Then  for  0  <  a  <  p  +  2, 


=  E[{n  +  2L){n  +  2L +  2)AL,a] 


(3.2.10) 


wiiere 


Al,.   =  E 


(2I  +  a-2)(^„(y)(l  -  7)  f        2  \~V  1 

(n-a  +  4)(n-p  +  2)    \n  +  2  +  2L     n-p  +  2j 


(3.2.11) 


L  ~  Poisson  {^(3^X^X(3)  and  V\L  ~  Beta  [\{p  +  2L),  i(n  -  p)) , 


Proof.       Let  T  =  (3'^X^X/3  and  G  =  T  +  S  so  that 


V  = 


l  +  W     l  +  T/S  G' 


45 


Then  eo,i(5)  =  GMV)  and  e^,^i(/3,5)  =  GM^)  ^here  u'.(V) 
^^i^V)  =  '^~!^}jXat,i^^ ■  For  an  estimator  Gi^{V)  of  a",  one  has 

=  [(Gt/.(K)  -  If  ] 


^  and 

n-p+2 


(3.2.12) 


If  we  write  77  =  (2(7^)"^  (5^X'^Xf3  then,  under  the  reparametrization  (/3/cr,  1),  5 
and  r  are  independently  distributed  with  S  ~  Xn-p  ^-nd  T  ~  Xp(^)-  If  ^^e  dummy 
variable  L  ~  Poisson{T]),  then  V  and  G  are  independent  conditional  on  L  with 
V\L  ~  Beta  (i(p  +  2L),  i(n  -  p))  and  G\L  ~  x^+2L- 
Then, 

Efi,a,i  [(Gt/.(V)  -  1)'  |]  =  E  Ep,,,,  \{Gm)  -  1)' 


=  E 


(n  +  2I)(n  +  21  +  2)  £;[V»2(y)|L]  -  2(n  +  2L)E[xi^{V)\L]  +  1 


=  E  |(n  +  2I)(n  +  21  +  2)  E  {  (^(K)  -  — J^)' |l)  +  - 


+  2L  +  2 

(3.2.13) 


Then,  we  evaluate 


E 


2  1 
L 


=  E 


n  +  2  +  2L 


{MV)-UV)} 


E 


(n  -  p  +  2)2 


[2UV)  -  <^^(K) 


2(1 -F) 


(n-p  +  2)(n  +  2  +  2I) 


L  . 

(3.2.14) 
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Differentiating  (f>a{V)  from  (3.2.9)  with  respect  to  V,  we  have 


p  —  a  +  2     n  —  p  +  2 
2V  2(1  -  V) 


MV)- 


n  —  a  +  4 
2V 


<f>liV) 


(n-a  +  4)(l  -V) 


n  —  a  +  4 


(3.2.15) 


Substitution  of  (3.2.15)  into  the  r.h.s.  of  (3.2.14)  yields 


(n-p  +  2)^ 


(n-a  +  4)(l  -  V) 


n  —  a  +  4 


2(1  -  V) 


(n  -  p  +  2)(n  +  2  +  21) 
Integration  by  parts  gives 


(3.2.16) 


E  {V{1  -  vy  <f>:{V)\L] 

=  /;v(i-v)^.-^-(i-.)V-v».iV^(^, 


(P  -\-2L    n  —  p' 


2     '  2 


[u  =   (1-u)    =  (?ia(t^) 


v=l 


u=0 


1  fp  +  2L  £±2i_i, 


2  ^        ^  2 


P-Ht  n-p  "I 

V  2   (1  -  u)  2   I  (?!>a(y)(fT; 


(3.2.17) 
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Substitution  of  (3.2.17)  into  (3.2.16)  yields 


E 


(1  -  V)ct>,{V)  /^^^  _y^_  p-a  +  2-{n-a  +  A)V  ^  (n-p  +  2)^^ 


(n-p  +  2)2 


(n  -  a  +  4) 


n  —  a  +  4 


P  +  2£,   _    ,     2(n-p  +  2)l 
2     ^         ^     (n  +  2  +  2L)J 


=  E 


{\-V)UV)  \^_2{v-a  +  2) 


(n-p  +  2)2 


n-a  +  4       (n -p  +  2)(rH- 2  +  2L) 


n-a  +  4^         '     n-a  +  V  '] 


P  +  2L 


=  E 


(1  -  V)<i>a{V)  r       2  2  (2i:  +  a-2)(l-7)  I 

(n-p  +  2)    \n-a  +  4      (71  +  2  +  21)      (n  -  p  +  2)(n  -  a  +  4)  J 


=  E 


{2L  +  a  -  2){l  -  V)<f>^{V)  (         2  1-V  \ 

(n-p  +  2)(n-a  +  4)     \(n  +  2  +  2I)      n-p  +  2j 


=  A 


L,a- 


(3.2.18) 


The  theorem  follows  now  from  (3.2.13),  (3.2.14)  and  (3.2.18). 

Remark.  It  is  proved  in  Ghosh  (1992b)  that  i4o,2  =  0,  Ai,2  >  0  for  alU  =  1, 2, . . .,  and 
Aj,a  >  0  for  all  a  6  (2,p  +  2),  while  Ao.a  <  0,  for  all  0  <  a  <  2.  So,  when  a  =  2  and 
7/  =  0,  there  is  no  risk  improvement  over  the  best  multiple  estimator  of  the  sample 
variance. 
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3.3    HB  Estimator  for  the  Reduced  Model 

We  consider  the  following  class  of  hierarchical  priors  for  the  reduced  model. 
(I)'  Conditional  on  /3,  i/i,  iZ  =  r  and  A  =  A,  F  is  distributed  as  Y"  ~ 

(II)'  conditional  on  i2  =  r,  Ui  and  A  =  A, 


/3~  iV 


\  0  / 


,  (Ar)-^(X^X)-^ 


(III)'  i/i,  R  and  A  are  marginally  independent,  with  Ui  ~  Uniform{W^), 
fR{r)  oc  r-5»  and  /a(A)  oc  A-5*(1  +  A)-5<*-''). 


Writing  U 


c  =  X^X  = 


/  \ 

C21  C22 


we  have  the  following  theorem. 


Theorem  3.3.1  Under  the  model  given  in  (I)'  -  (III)', 
(if  conditional  on  Y  =  y,  R  =  r  and  U  =  u, 


(3^  N 


(1  -  u)/3i  +  u/3i 


(^21   C22  +  iz;i^22.i 


(3.3.1; 


where  C22.1  =  C22  -  C21C1/C12; 


(iif  conditional  on  Y  =  y  and  U  =  u, 

R  ~  Gamma  ^^(5  +  ut),  ^(n  -  pi  —  a  +  2)j 


where  t  =  ^2  ^■i2.\^2 


(iiif  conditional  on  Y  =  y  the  pdf  of  U  is  given  by 

fu{u\y)   ~    u5('^-'''(l +itti;)-^("-'''-''+'>/(o,i)(") 
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(3.3.2) 


where  w  =  t/s. 


Proof.       The  joint  pdf  of  Y,  (3,  Ui,  R  and  A  is  given  by 


f{y,f3,t^i,r,X)  oc  r"  exp 


-l\\y-X(3\\' 


x(Ar)2  exp 


-~  (/3i  -  ^if  Cn  {f3r  -  i/i)  +  /3^C22/32  +  2  {(3,-  u,f  €,2^2 


X  r-t  A-5*(l  +  A)-5(^-*). 


(3.3.3) 


Write 


and 


(3.3.4) 


(^1  -  u^f  Cii  (/3i  -  «/i)  +  /3[C22/32  +  2  (/3i  -  i/i)^ 


(3.3.5) 


Integrating  with  respect  to  Ui  in  (3.3.3),  and  using  (3.3.4)  and  (3.3.5),  one  gets 


/(y,/3,i/i,r,  A)    oc  exp 


—    r2(Ar)2  exp 


(3.3.6) 


Next  observe  that 


y  C21   C22  y 


/  -  \ 
/3: 


/      r  ^ 


and  Cn/3i  =  Xfy, 


so  that 


(3.3.7) 


and,  hence, 


(3.3.8) 
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Then,  substituting  (3.3.7)  and  (3.3.8)  in  the  exponent  from  (3.3.6),  we  have 
(^-(3fc(0-(3)  +  Xf3^C22.if32 


+2  (3:  -  f3^Y  Cu  (^2  -  /32)  +  A/3^C22.i/32 


'^x  -  (/3i  +  C:,'C,202)  +  Cn'C,2  {/32  -i^j))' 


X  Cn  [/3i  -  {(3,  +  C-^Ci2/3j  +  Cr/Ci2  (/32  -  ^2)] 


+2 


+  {(^2  -  '02^  C22  (/32  -  '^2)  +  A/3[C22.l/32 


=     (/3i  -  rjf  Cu  (^1  -  T?)  +  {^2  -  (32Y  C22.X  (^2  -  (^2)  +  A/3[C22.l/32 


where  r?  =/3i  +  C-'Ci2/92 


-232^22.1/92  +  (1  +  A)/3^C22.l/32 
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1  +  A 
Therefore, 


A     -T  ~ 

P2  C^22.lP2 


(3.3.9) 


f{y,rj,P„r,X)   a   r=T^(Ar)f  A-^^(l  +  A)-^^^-'') 


xexp 


(3.3.10) 


and,  hence, 
/ 


(3,  +  C-,'Cuf32 

/32 


AT 


0 


°  TTa^22.1 


/J 


(3.3.11) 

Using  (3.3.8),  one  gets  (i)'  from  (3.3.11).  Next  integrating  (3.3.10)  with  respect 
to  T]  and  (32,  one  has 


/(y,r.  A)    a  exp 


A 


1  +  A 


(1  +  A) 


-2 


Put  u  =  A/(1  +  A).  Then,  du  =  {1  +  Xy^dX.  Therefore, 


/(y.^'iA)   a  exp 


f  {  ^        A    -  r 


(3.3.12) 


(3.3.13) 
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and. 


/(y,r,A)    oc  +  uw)-^^"'"'"'^''^ 

where  w  =  t/s.  This  gives  us  the  theorem. 

Therefore,  under  the  loss  (3.1.3),  the  HB  estimator  of  cr^°  is  given  by 

E{R°\Y) 


(3.3.14) 


E{R^°\Y) 


\2J  TUn 


ri(n -p^-a  +  2a  +  2)  E[{1  +  UWr^^Y] 


(n  -  pi  -  a  +  4a  +  2)  E  [(1  +  UWy^^lY] 


=   S°  <f>lk,a{^)^  say. 


(3.3.15) 


We  will  again  restrict  our  attention  to  the  subclass  e*^_„(y).  Using  arguments 
similar  to  those  in  Theorems  3.2.1  and  3.2.2,  we  now  have  the  following  theorem. 

Tiieorem  3.3.2  Under  the  loss  (3.1.3),  elaaO^)  dominates  €q  ^(S)  for  every  a  € 
[2,P2  +  2). 

Note  that  e*  „  i(Y')  can  be  simplified  further  as 

S  u^(l  +  uu;)-5("-'''-»+''>(iu 


\  ^  /o^z^(l-z)^("-P)dz 

/o^2'^(l-z)5("-''+^'(iz 


n  —  pi  -  a  +  4 


n  —  pi  —  a  +  4 


1  + 


P2  -  a  +  2 
n  -  p  +  2 
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n  —  p  +  2 


(3.3.16) 


where 


yi(P2-<'+2)(l  _  \/)5{n-P+2) 


(3.3.17) 


The  following  theorem  provides  an  exact  expression  for  the  risk  of  e^^^^iY)  for 
o  €  [0,p2  +  2)  under  the  loss  (3.1.3). 

Tiieorem  3.3.3  Consider  the  model  under  which  Y  and  S  are  mutually  independent 
with  Y  ~  N{X/3,  a^In)  and  S  ~  (t^I.^.  Then  forO  <a<p2  +  2, 

,2 


=  E[{n  +  2L'){n  +  2L' +2)Al.  J 


(3.3.18) 


where 


{2L' +  a  -  2)<l>l{V){l  -  V)  f        2  l-V 

(n-a  +  4)(n-p  +  2)      \n  +  2  +  2L*      n-p  +  2 


(3.3.19) 


L*  ~  Pm33on  (^^2  022.1^2)  ^nd  V\L'  ~  5e<a  (l(p  +  2L*),  |(n  -  p)), 


The  proof  of  this  theorem  is  omitted  because  of  the  similarity  of  its  proof  to  that 
of  Theorem  3.2.3. 

Remark.  As  noted  in  the  remark  after  Theorem  3.2.3,  Ao,2  =  0,  i4j.,2  >  0  for  all 
/  =  1,  2, . . .,  and        >  0  for  all  a  6  (2,p  +  2),  while  Ao,a  <  0,  for  all  0  <  a  <  2. 
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3.4    Interval  Estimation  of  a' 

This  section  is  devoted  to  the  comparison  of  different  confidence  intervals  for  £t^ 
beised  on  the  hierarchical  models  given  in  Sections  3.2  and  3.3.  Using  S  alone,  the 
minimum  length  confidence  interval  for  with  confidence  coefficient  1  —  a  is  given 
by 

Ci(5)  =  {(7^  :  ci5  <a^<  C25},  (3.4.1) 
where  Ci  (  >  0  )  and  C2  (  >  Ci  )  are  obtained  from 

/„_p+4  [j]  =  /n-p+4  [j]  and  12  fn.p{x)dx  =  1  -  a,  (3.4.2) 

/„(i)  being  the  pdf  of  x^.  a  chi-sqaure  variable  with  i/  degrees  of  freedom  (Tate  and 
Klett,  1959;  Maatta  and  Casella,  1990).  On  the  other  hand,  the  shortest  unbiased 
confidence  interval  for  <t^  with  confidence  coefficient  1  —  a  is  given  by 

C,{S)  =  {<T^  :  diS  <a^  <  diS},  (3.4.3) 

where  di  {  >  0  )  and  d2  {  >  di  )  are  obtained  from 

/n-p+2  =  /r.-p+2  (J^)  and  J2  fn-p{x)dx  =  1  -  a.  (3.4.4) 

We  now  find  a  class  of  HPD  credible  intervals  for  under  the  hierarchical  priors 
considered  in  (I)  -  (III)  and  (I)'  -  (III)'.  We  shall  show  that  a  subclass  of  such  HPD 
credible  intervals  have  smaller  length  as  well  as  greater  coverage  probability  (in  the 
frequentist  sense)  than  the  intervals  described  in  (3.4.1)  and  (3.4.3). 

Consider  first  the  class  of  priors  given  in  (I)  -  (III).  Also,  we  confine  attention  to 
intervals  of  the  form 

C^{W,  S)  =  {c^  :  ar(l>{W)S  <      <  a2<i>{W)S},  (3.4.5) 
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where  oi  (  >  0  )  and  02  (  >  ai  )  are  arbitrary  constants,  while  0  <  4>{W)  <  I.  Recall 
that  W  =  T/S  and  T  =  0^X'^X0. 

Within  the  class  of  all  intervals  of  the  form  (3.4.5),  the  optimal  (i>o(W)  which 
leads  to  the  HPD  credible  interval  based  on  the  prior  given  in  (I)  -  (III)  is  given  by 
the  following  theorem. 

Theorem  3.4.1  Consider  the  class  of  priors  given  in  (I)  -  (III)  with  n-p  +  b-a  >  0 
and  b  <  p  +  2.  Then,  within  the  cIilss  of  intervals  of  the  form  (3.4.5),  the  optimal 
(j!)(iy)  is  given  by  Mw),  where  Mw)  is  a  solution  of 

Jn-p+b-a+4  I  "TT  ^   ^ p-b+2 


(i)o{w)ai  J  \</>o(iy)ai/ 

=     fn-p+b-a+A  (   ,    ,  \       1  Fp-b+2  (    .   ."^^       |  »  (3-4-6) 

where  f^{x)  denotes  the  pdf  of  xl,  ^.nd  Fu{x)  =  P{xl  <  x)- 

The  proof  is  omitted  because  of  its  similarity  to  the  proof  of  Theorem  2.3.1. 
Consider  the  special  case  b  =  a{<p  +  2).  If  we  now  set  oi  =  ci  and  03  =  ci, 
from  Theorem  3.4.1,  we  have  obtained  a  class  of  HPD  credible  intervals  of  the  form 

/^o  =       '■  (^iM'^)^  <<y^<  C2(i)o{w)s}  (3.4.7) 
for  (7^  where  (i>Q{w)  satisfies 

f      i    ^    \f      (  ^ 

/n-p+4     -T',  ; —  fp-a-\-2 


(i)o{w)ci)  \<f>o{w)ciJ 
i  .  ■        =   f,_^^J—^]Fp.^^J-^]  (3.4.8) 

\(pQ{w)C2j  \(po{w}C2) 

Using  arguments  similar  to  those  given  in  Theorem  2.3.2,  we  now  have  the  fol- 
lowing theorem. 

Theorem  3.4.2  For  an  interval  given  in  (3.4.7)  and  (3.4.8)  with  0  <  4>o{w)  <  I, 
L(I^)  <  L(Iml)  and  PA<y-  G  I^)  >  P^^ia^  e  Iml)  for  all  a\ 


To  improve  on  confidence  intervals  given  in  (3.4.3)  and  (3.4.4),  choose  b  =  a-2 
and  a  <  p  +  4  in  (3.4.6)  to  get  the  following  class  of  HPD  credible  intervals 

{a^  :  di<f)o{w)s  <a^  <  diMyJ^},  (3-4.9) 

for      where  <l>o  is  obtained  from 

1^:*) {^)  - i^i (^) 

Following  the  line  of  proof  of  Theorem  2.3.2,  it  can  be  shown  that  L{I^„)  < 
L{Isu),  and  P^2{(x^  e  I^)  >  P<,s(ct2  g  Isu)  for  all  a-,  where  denotes  the  HPD 
credible  interval  defined  in  (3.4.9)  and  (3.2.14)  and  Isu  denotes  the  shortest  unbiased 
confidence  interval  given  in  (3.4.3)  and  (3.4.4). 

Next,  we  improve  upon  confidence  intervals  of  the  forms  given  in  (3.4.1)  -  (3.4.4) 
using  the  class  of  priors  given  in  (I)'  -  (III)'.  Using  an  argument  analogous  to  that 
given  in  Theorem  3.4.1,  for  a  <  p2  +  2,  a  class  of  HPD  credible  intervals  of  the  form 

{(T^  :  Ci<i>o{w)s  <a^  <  C2<i>o{w)s}  (3.4.11) 

for  cr-  where  <t>o{w)  satisfies 

f      i    '    \f      (    ^  \ 


4>o{w)ci  J  \(po{w)Ci/ 
=    f„_^^J^-^]F^.,^J—^]  (3.4.12) 

is  obtained.  Following  arguments  similar  to  those  given  in  Theorem  2.3.2.  this  class 
of  HPD  credible  intervals  have  shorter  length  and  higher  probability  of  coverage  than 
the  minimum  length  confidence  interval  based  on  5  only  given  in  (3.4.1)  and  (3.4.2). 

Similarly,  for  a  <  p2  +  4,  we  have  a  class  of  HPD  credible  intervals  for  cr^  of  the 
form 

'  '  ^  {(T^  :  d^(i>o{w)s  <  a'^  <  d2<t>o{w)s}  (3.4.13) 
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for      where  (f>o{w)  satisfies 

(^)  =      {^) {^)  <'-^-"' 

which  dominate  the  shortest  unbiased  confidence  interval  based  on  S  only  given  in 
(3.4.3)  and  (3.4.4). 

3.5    Numerical  Results 

We  now  provide  some  numerical  calculations  of  the  percent  risk  improvement 
of  ^aa,i  «a,a.i  o^er  5/(n  -  p  +  2).  Figures  3.1  and  3.2  give  the  percent  risk 
improvements  for  n  =  10  and  p  =  3,  and  for  n  =  15  and  p  =  7,  respectively,  plotted 
against  t]  =  (2a^)"^  (3^X'^X(3  using  the  expression  given  in  Theorem  3.2.3.  Figures 
3.3  and  3.4  give  the  percent  risk  improvements  for  the  latter  combination  with  pi  =  2 
and  P2  =  5,  and  pi  =  5  and  p2  =  2,  respectively,  plotted  against  tj  using  the  expression 
given  in  Theorem  3.3.3.  When  a  =  2,  our  HB  estimator  is  the  Brewster- Zidek  (1974) 
estimator  in  the  special  case  of  only  one  cell  mean. 


59 


60 


61 


Fig  3.3.       Percent  Risk  Improvement  of  e^^i  over  for  n  =  15,  pi  =  2  and 

P2  =  5 


CHAPTER  4 
ESTIMATION  OF  THE  VARIANCE  RATIO 

4.1  Introduction 

This  chapter  considers  estimation  of  <y\jo\^  the  ratio  of  two  normal  variances 
based  on  independent  random  samples.  Under  any  quadratic  loss,  there  exists  a 
unique  best  multiple  of  the  ratio  of  the  sample  variances.  However,  when  the  popula- 
tion means  are  unknown,  such  an  estimator  has  been  found  to  be  inadmissible  under 
any  quadratic  loss.  Indeed,  Gelfand  and  Dey  (1988b)  have  found  Stein  type  testima- 
tors  which  dominate  the  best  multiple  estimator  mentioned  above.  Such  testimators, 
being  non-smooth,  are  themselves  inadmissible. 

In  Section  4.2,  two  classes  of  hierarchical  Bayes  estimators  of  a\  j a\  are  developed. 
The  two  classes  of  hierarchical  priors  are  extensions  of  the  ones  considered  in  Section 
2.2  of  this  dissertation  and  in  Ghosh  (1992b)  for  the  one-sample  case.  It  is  shown  in 
Section  4.3  that  certain  subclasses  of  these  HB  estimators  dominate  the  best  multiple 
estimator.  The  risk  dominance  results  are  proved  by  two-sample  extensions  of  certain 
general  results  of  Kubokawa  (1991)  in  the  one-sample  case.  Also,  exact  expressions 
for  the  risk  improvement  is  provided  for  one  of  the  two  classes  of  HB  estimators. 

4.2    Development  of  the  HB  Estimators 


Let  Fi,  (i  -  l,...,ni)  and  Yi,  {j  =  l,...,n2)  be  independent  random  samples 
from  N{mi,  cr^)  and  7V(m2,  af)  distributions  respectively.  The  minimal  sufficient 
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statistic  is  (Yi,  y'j,  5i,  52),  where  Yk  =  71-^2"^!^,  and  Sk  =  2r=i(^fc.  "  ^'  =  1, 2. 
Consider  estimation  oi-q  =  crlj (t\.  If  >  6,  then  under  the  usual  group  of  location- 
scale  transformations,  and  the  loss 

L(r,,  a)  =  f-  -  1)  ,  (4.2.1) 


the  best  equivariant  estimator  of  Tj  is 


Ui  +  1  ^2 

However,  it  is  shown  in  Gelfand  and  Dey  (1988b)  that  under  any  quadratic  loss 
Q{crl,  al)  L{t],  a),  where  Q{(tI,  cr|)  >  0,  the  best  equivariant  estimator  60  is  an 
inadmissible  estimator  of  rj,  and  is  dominated  by  both 

^  '  \°'     (n,+2)S2  / 

and 

(n2-4)5i  \ 


$2  =  mm  I 


(nx  +  i)Err22,;" 

Certain  generalizations  of  these  estimators  were  also  provided  by  these  authors.  It 
should  be  noted  that  the  estimators  6^  and  62  are  analogous  to  the  estimators  of 
Stein  (1964)  in  the  one-sample  case,  where  the  best  equivariant  estimator  of  the 
normal  variance  under  the  group  of  location-scale  transformations  was  shown  to  be 
inadmissible  by  a  class  of  testimators.  As  discussed  in  Section  4.1,  such  estimators 
being  non-smooth  are  themselves  inadmissible. 

We  now  develop  two  classes  of  HB  estimators  of  (rl/aj.  Let  =  R^^  {k  =  1,2). 
The  first  class  of  hierarchical  Bayes  priors  is  as  follows: 

(I)  Conditional  on  Mm  =  mk,  Rk  =     {k  =  1,2)  and  Ai  =  Ai,  fj,  Y2,  Si  and 
^2  are  mutually  independent  with 

n~Ar(mfc,  (nkrk)-')         and        5^  ~  r.-^Xn.-i,     ^  =  1,2; 
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(II)  Conditional  on  Rk  =      and  Ai  =  Ai,  Mi  and  M2  are  independent  with 
Ml  ~  i"V(0,  (AiTi)"^)        and        M2  ~  Uni  for  mi  — 00,00); 

(III)  Marginally,  Ri,  Ai  and  ilj  are  independent  with 

/Ri(ri)   oc   r"'"',  0<ai  <ni+2, 

/a.(Ai)   «    Ai"  =  '''(ni  +  Aa)-^(''-''^),  0  <  6i  <  3 

and 

fR,{r-2)    oc    rj .         0  <  a2  <  nj  +  1. 
Writing  U\  =  h-ij  (ni  +  Ai),  the  following  theorem  is  now  proved. 
Theorem  4.2.1  Under  the  model  given  in  (I)  -  (III),  one  has 

(i)  conditional  on  Yk  =  yk,  Sk  =  Sk,  Rk  =  rk  (k  =  1,2)  and  Ui  =  ui,  Mi 
and  M2  are  mutuadly  independent  with 

Ml    ~    iV((l-ui)yi,  (niri)-^(l-ui)) 

and 

M2   ~       (y2,  (nzT-j)-^)  ; 

(ii)  conditional  on  Yk  =  yk,  Sk  =  Sk  {k  =  1,2)  and  Ui  =  ui,  Ri  and  R2  are 
mutually  independent  with 

Ri    ~    Gamma(^^{si+uti),^{ni-ai  +  2)^,        =  myl 

and 

1  1 


R2    ~    Gamma  ^-53,  -(n2  -  02  +  l)j  ; 
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(Hi)  conditional  on  Yi  =  yi  and  S\  =  s\,  the  pdf  of  U\  is  given  by 

fu{ur\yx,s)  ~  4^'-''V  +  «it^ir'^"^"*'^^"-f(o.i)(«),  (4.2.3) 


w. 


here  wi  =  ti/si. 


Proof.       Based  on  (I)  -  (III),  the  joint  distribution  is  given  by 
fiVi,  yi,  Si,  32,  mi,  mj,  ri,  r2,  Aj) 


oc 


n{ 


'  k  ^ 


e    2  5. 


} 


X   (V,)'"e-V"!n{r-'-)A-'''(n,  +  A,)-*'-'-'  (4.2.4) 

fc=l   ^  ^ 

Integrating  (4.2.4)  with  respect  to  mi  and  m2,  one  has 


/(yi,  y2,  Si,  32,  rirj,  Ai) 


^  /  Aj 


"1  +  Ai 


(ni  + Ai) 


-2 


X  eip 


^1  J  ^1      -2  , 


2  [  'ni  +  Ai 
Then,  from  (4.2.5),  we  have 


n  Gf^)  (4.2.5) 


/(yi,  •si,  ri,  Ai)  a  rj 


"1 

3 


Ai 


Til  +  Ai 


X  eip 


^1  I  ^1      -2  , 

"1 — — ^yr  +  •51 


2  1    m  +  Ai' 


Si      (ni  +  Ai) 


-2 


and 


/(y2,  52,  r-j)  oc  =      e"   5    5,  ' 


(4.2.6) 


(4.2.7) 
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Putting  Ui  =  )^i/{ni  +  Aj  in  (4.2.6),  we  have 


fiVi,       ri,  ui)  a      '    u^'  exp 


^1   f  -2  , 


r■^  r        _2        .  ]  nip. 


1 ' 


Now,  we  have  from  (4.2.8), 

/  (ri,  til  yi,  si)  oc  exp 
ajid,  from  (4.2.7), 

/  (^2|-S2)  OC  e~^r-2 
This  completes  the  proof  of  the  theorem. 

Using  the  loss  given  in  (4.2.1)  the  HB  estimator  of  rj  =        Ri  is 


—07 — I 


^Oi  ,61 ,02 

From  (ii)  of  Theorem  4.2.1,  one  has 


E 

{Ri\Yu 

50 

E 

(R2' 

Y2, 

S2) 

E 

(Ri\yu 

s.) 

E 

(R2' 

Y2, 

52) 

E  (R^' 

Y2,  S2) 

E  {R^' 

Y2,  S2) 

(4.2.8) 


(4.2.9) 


(4.2.10) 


(4.2.11) 


(4.2.12) 


E{Ri 

Vi,  s,) 

5i  E 

Yu  Sy 

e[r\ 

Yu  5i) 

ni  -  ai  +  4  E 

(l  +  C/iV^i)-2 

Yu  Sy 

Si     /o^4^'''"'(i  +  uivrx)-?("^-'"^-"(/ui 


(4.2.13) 

Combining  (4.2.12)  and  f4.2.13),  the  HB  estimator  given  in  (4.2.11)  of  r?  simplifies  to 
^  ^       ^   5i  (n2  -  a2  -  3)  /o^  4^''''^'(l  +  uiH^i)-;<"^-'^+-»dm 


=     ^  <^a,,i,i.aj(H^l) 


(4.2.14) 
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The  second  class  of  hierarchical  priors  is  as  follows: 

(I)'  Conditional  on  Mk  =  ruk,  Rk  =  '^fc      =  1,2)  and  A2  =  A2,  Yi,  V2,  5*1  and 
52  are  mutually  independent  with 

Yk  ~  jV  (rrik,  (nfcrfc)"^)         and        Sk  ~  '"fc^Xn^-i.      ^  =  1,2; 

(II)  '  Conditional  on  Rk  =  rk  and  A2  =  A2,  Mi  and  M2  are  independent  with 

Ml  ~  Umform{- 00,  00)        and        M-i  ~  iV(0,  (Ajrj)"^) ; 

(III)  '  Marginally,  Ri,  R2  and  A2  are  independent  with 

/Ri(n)   oc   Ti'"*,  0  <  ai  <  ni  +  1, 

/h,(''2)    oc    rj^"*',  0  <  a2  <  712  +  2, 

and 

/a,(A2)   oc   Aj^'^(n2  + A2)-^<'-''>  0<62<3. 

Let  U,  =  Then  /tr,(u2)  oc  u," 

Based  on  the  above  class  of  hierarchical  priors,  the  following  theorem  is  now  obtained. 

Theorem  4.2.2  Under  the  model  given  in  (I)'  -  (III)',  one  has 

(i/  conditional  on  Yk  =  yk,  Sk  =  Sk,  Rk  =  rk  {k  =  1,2)  ancf  U2  =  u^,  Mi 
and  M2  are  mutually  independent  with 

Ml    ~    N{yu  [nin)-')  ■ 

and 

M2    -    A^((l -U2)y2,  (n2r2)-'(l -U2))  ; 
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(ii)'  conditional  on  Yk  =  t/fc,  Sk  =  Sk  {k  =  1,2)  and  U2  =  uj,  -^i  and  R2 
are  mutually  independent  with 

11 


and 


Ri    ~    Gamma  (^-Si,  -(ni  —  ai  +  l)j  , 


il2    ~    Gamma  (  -(^2  +  ^^2),  ^(^2  — 


(iii/  conditional  on  Y2  =  ^2  and  52  =  52,  tie  pdf  of  i72  is  given  by  * 
■     /a(u2|y2, 5)  ~  4^'"'^ V  +  «2u;2)-'^"'-'''+'^/(o.i)("),  (4.2.15) 
where  W2  =  t2f  32-  .■  ' 

The  proof  is  omitted  because  of  its  similarity  to  the  proof  of  the  previous  theorem. 
Now,  from  Theorem  4.2.2,  the  Bayes  estimator  of  77  under  the  loss  (4.2.1)  reduces 
on  simplification  to  - 


J/ai  ,02,63  — 


E(Rr 

E  {R2' 

Y2,  S2) 

E  (ri 

E  {R2' 

Y2,  S2) 

£j(l  +  C/2W^2|V'2,  ^2) 


51  ("2  ~  ^2  —  2) 

52  ("1  -  ai  +  3)' 

51  (722  -  a2  -  2)      ul^'-''\\  +  U2H^2)-?<"^-'")rfn2 

52  (ni  -  ax  +  3)  jji  4(i-^)(i  +  ^^p^,^)-i(n,-a,-2)^^^  • 

5i 


=  4>ai,ai,h,{W2) 
02 


(4.2.16) 

In  the  next  section,  we  derive  frequentist  properties  of  the  HB  estimators  1701,61,02 
for  certain  choices  of  (ai,6i,a2)  and  of  the  HB  estimators  ^oi,aj,6s  for  certain  choices 
(01,02,62). 
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4.3    Risk  Dominance  over 

The  main  focus  of  this  section  is  to  investigate  conditions  under  which  some 
of  the  HB  estimators  developed  in  the  previous  section  dominate  60  =  (Si / S2) 
(„2  -  5)  /  (ni  +  1),  the  best  multipleof  5i / 52  for  estimating  rj  =  aj/al.  Two  general 
results  are  derived  in  this  section.  The  first  of  these  results  provides  the  basis  for 
determining  ai,  61  and  so  that  the  resulting  rja^MA,  smaller  risk  than  that  of 
80  under  squared  error  (  and  hence  any  quadratic  )  loss.  The  second  result  provides 
a  similar  basis  for  determining  aj,  aj  and  62  so  that  the  resulting  ^ai.a2.6,  has  smaller 
risk  than  that  of  ^0  under  squared  error  loss. 

In  order  to  prove  the  first  result,  consider  a  class  of  estimators  Si{Si,  S2,Wi)  = 
{S1/S2)  hC^i)  for  estimating  t/.  Denote  by  /^(i;  r)  and  F^ix;  t)  the  pdf  and  the 
distribution  function  respectively  of  a  chi-square  with  u  degrees  of  freedom  and  non- 
centrality  parameter  r.  Also,  to  simplify  notation,  write  f^{x;  0)  and  0)  as 

f^{x)  and  Fu{x)  respectively. 

The  first  theorem  of  this  section  is  as  follows.  ^  ; 


Theorem  4.3.1  Suppose  n2  >  6, 

(a)  <f>i{wi)  is  t  in  wi,  and  '  "  ^ 

(b)  \\mM^^)  =  ^^■  '  '  ^  ' /  '  V 
Then  ^i(5i,  52,  Wi)  dominates  60  under  the  loss  (4.2.1),  i.e., 

for  all  mi,  al  and  a\  if 

(n2-5)  /o-^l  +  zQ-^zr'^^i 


(i>i{wi)  > 
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=   Mwi)  (say). 


Proof.       Write  n  =  \nim\/(Tl.  Then, 


foo  roo  toot  X\  1  ^ 

=    /    /    /   i</'i(ifi)  If  Xi/„i_i(xi)/„,_i(x2)/i(iyia;i;ri)<iiidx2<it^i  • 

70   70  I  12  J 

(4.3.1) 


Thus, 


i^iCtyi)  U  /ni-i(a:i)/„3_i(i2)-P'i(ifia;i;ri)(ixi(ix2 

Xj  -> 


/       "5          /      /     {(i>\{W\)  W  /nt-l(xi)/„j-l(x2)Fi(tOiXi;Ti)dxi(fx2 

Jo    \_UW\  Jo    Jo     K  X2  ) 


Xi 

<f>l{wi)  1>  Xifni-l{Xi)fn,.i{x2)fl{wiXi\Ti)dXidx2dWi 

X2  J 


dwi 


2  \  (f>i{wi)  1  >  <f>[{wi)—f„^-i{xi)f„,.i{x2)Fi{wiXi-Ti)dxidx2dwi 

K  X2  )  X2 


roo  roo  roof  Xj  "1  ^ 

=      /     /     /     {<f)l{Wi)  l\  Xifr,^-i{Xi)fr,,-iix2)fl{wiXi\Ti)dXidx2dwi 

Jo  Jo  Jo    K  X2  ) 

f°°  f°°  f°°     (  Xj  "I  Xi 

+  /  'i>'\{wi)  I    /   2\(i>x{w-i,)  U  — /ni-i(xi)/„j_i(x2)Fi(iyiXi;Ti)(ixi(ix2<iioi, 

JO  Jo    JO        K  X2  )  X2 

■  '        .       ■  -  •  r-  (4.3.2) 

so  that 

v 

Em.,.l.l  [8o/-n  -  if  -  F„,..^.i  (^i/r?  -  l)'     "     '^^      '  " 
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2  {  M^i)  1  \  —fni-\{Xl)fni--Lix2)Fi{WiXi;Ti)dXidX2 

(4.3.3) 

Since  (t>'i{wi)  >  0,  the  right  h<ind  side  of  (4.3.3)  is  >  0  if  and  only  if 

^Jn,-i{xi)fn,-i{x2)Fr{wrXr;T,)dxidx2 


4>iiwi)  > 


Io°  Io°  ^fni-l{Xl)fn:,-l{X2)Fi{WiXi;Ti)dXidX2 

fo°  a:i/„i-i(xi)Fi(u;iii;  Ti)  |/o°°     Vn2-i(a;2)«^X2}  dxi 

Io°  xlfni-l{Xl)Fi{WiXi;  Ti)  |/o°°  X2^ fn,_i{x2)dx2]  dXi 

^  (n;  -  3)"^  /o°°ii/ni-i(ii)Fi(u;iii;ri)(fii 

((nj  -  3)(n2  -  5))-i  /o~  x?/„i-i(ii)Fi(ti;ia;i;  Ti)(ia:i 

_    ,  Xif„^_i{xi)Fi{wiXi;Ti)dxi  1- 

^  /o°°a;i/ni-i(xi)i^i(t^ia:i;Ti)<iii 

=    (nj  -  5)Er,{Vf^)  for  all  n  >  0, 


(4.3.4) 


where  the  pdf  of  Xi  is  given  by 

/iri(xi)  oc  xlfr^,-i{xi)Fi{wiXi;Ti) .  (4.3.5) 

For  Tj"  >  r{, 

^r"(a;i)  Fi{wiXi;t{') 

T~7 — \  °^  "cw  77      T  in  lyixi,  i.e  in  ij. 

/ir'(a;i)  Fi(u;iXi;ti') 

Therefore,  E{X^^)  is  J.  in  Ti  so  that  (4.3.4)  holds  for  all  n  >  0  if  and  only  if 

Xifni-l{Xi)Fi{WiXi)dXi 


Io°  xlfni-i{xi)Fi{wiXi)dxi 


73 


' -        !^ x\u,-,{x,) h{x,z,)dz,dx, 

"  !^       x\U.,{x,)  h{x,z,)dz,dx, 

=    (n2— oj  ni-i  — ~  ' 

!-^z\-'(j^e-'^('^^^xfdx,)dz, 

=  ("2-5) — T-rn — ^z±i  ; —  ' 

(n2-5)  /-^(l  +  zi)-^z^'jzi 
("1 +2)  + 

=   Mt^i)-  (4-3.6) 

We  now  investigate  conditions  under  which  i7^^i,aj  dominates  Sq  under  the  loss 
(4.2.1).  Specifically,  we  have  the  following  theorem. 

Tiieorem  4.3.2  Suppose  >  6.  If  =  2  and  2  <  aj  =  6i  <  3,  then  the  HB 
estimator  7;^^^  „j  dominates     under  the  loss  (4.2.1). 

Proof.       From  (4.2.14),  putting  02  =  2, 

51  (712  -  5)  ul^'-'"\l  +  tXityO-i("^-°'+-')dui 

'S'2(ni  -ai  +  4)ja^§(i-0^j^^^P^^^_i(„,_a,+6)^^^ 
_     01      ("2  -  5 J      Jo  \i+^,wj  U+mtVi/  (H-u^^y^)^ ""1 

52  (ni  -  ai  +  4)  ,1  /  u.w^,  /     1  ly,  J 

JO  Vl+uiVrJ  \l+uiWj  * 
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51  {n,-b)     /o^-/(^-^^-'x^(^-°''(l-x)-^<"--^Wx  ' 

52  (ni  -  ai  +  4)  x5(i-'»(l  -  x)-5("^+i)(iui 


5a  (n2  -  5)  P 

5e<a  (i(3  - 

\{n^  +  l))  < 

(l+«'l)J 

52(ni  +  l)p 

Beta  (i(3  - 

ai), 

Km +  3))  < 

(l+«'l)J 

Now,  using  (4.3.6),  one  has 


(4.3.7) 


Then, 


5i  (n2  -  5)  P 

Beta  (i. 

i(ni  +  l)) 

< 

l+Wl 

52  (ni  +  l)p 

linx  +  3)) 

< 

toi 
l+u;i 

5l  (712  -  5)  J" 

Beta  (i. 

|(m  +  l)) 

< 

1+tux 

52  (ni  +  l)p 

Beta  (i, 

|(ni  +  3)) 

< 

Wl 
1+tui  . 

=  '72.2,2 

<  ■  " 

'lai,ai,2 

Beta  (i(3 

-ai),  |(7ii  +  1))  < 

toi 

'72,2,2  P 

Beta  (i(3 

-ai),  Krti+S))  < 

Ull 

l+tui  J 

p 

Beta(i,  |(7ii  +  1))  < 

l+u/i  . 

p 

Be<a(i,  |(na  +  3))  < 

vLV  (4.3.8) 


>  1 


(4.3.9) 


by  appealing  to  Lemma  2.1.1.  To  prove  that  7701,02,2  dominates  6q,  it  now  suffices  to 
verify  conditions  (a)  and  (b)  of  Theorem  4.3.1. 
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To  verify  condition  (a),  note  that  Vi  =  Wi/{1  +  Wi),  so,  as  Wi  oo,  Vi  ^  1. 
Then,  for  02  =  2  and  ai  =  ii  <  3,  one  has  from  (4.3.7), 

n2  -  5 


lim  <pi{wi)  = 
wi->oo  nj  -(-  1 


(4.3.10) 


To  show  that  4>i(wi)  is  t  in  Wi,  define  a  new  pdf, 

^(z)ocz5(l-z)i("-^V.,(^)-     .  (4-3.11) 

Then,  for  0  <  ui  <  v[,  f^>^{z) / f^,{z)  oc  I^o.v[){z) / I(o,vi){z)  T  in  z.  Note  that 
4>\iw\)  <x  E{1  —  Z)""',  where  expectation  is  taken  with  respect  to  the  pdf  given  in 
(4.3.11).  Hence,  <i>i(wi)  is  t  in  vi  and,  hence  in  wi  (see  Lemma  2(i),  p. 85,  Lehmann, 
1986). 

Before  deriving  an  expression  for  the  risk  difference  of  Sq  and  77^^  j)  note  that 


V^^^  1  can  be  written  as 

K  -  5)     Jo^^/(^^^^)x?(^-''0(l  -  x}-^(-^-')du^ 
S2  (ni  -  ai  +  4)  x^(i-oi)(l  -  x)5K+i)(iui 

51  {n,  -  5)     /o^-  x'^^'-^^\l  -  x)^("^-^)ciu^      ■  , 

52  (ni  -  ai  +  4)  /j,^^  x5(i-»')(l  -  x)5("'+^)dui  ' 


■Si     (nz  -  5) 
S2  {ni  -  ai  +  4) 


51  (n2  -  5) 

52  (ni  -  ai  +  4) 


1  + 


1  + 


/o^^x?(^-°^)(l-x)f("^-^)^m 

/0^^X5(^-»^»(1  -X)5("^  +  I)dlil 


Vi 


3  -  oi  2 
ni  +  1      ni  +  1  /J^i  a;5(i-»i)(l  -x)5<"i+i)(iui 


(n2  -  5)  5i 
(ni  +  1)52 


(l-<^a.(V'l)) 


(4.3.12) 


where 


K{Vi)  = 


(4.3.13) 
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Now,  an  expression  for  the  risk  difference  of  60  and  r]"^%^^2  obtained  using  the 
following  theorem.  :  , 

TJieorem  4.3.3  Consider  the  model  under  which  Yi  and  Si  are  mutually  independent 
with  Yi  ~  N{mi,  nf^cr^)  and  5*1  ~  c^iXni-i  ^2  and  ^2  are  mutually  independent 
with  Y2  ~  iV(m2,      V|)  and  ^2  ~  (^Ixl^-n  where      >  6.  Then  for  2  <  oi  <  3, 


(^0.--l)^-«1.2.-^-l)^ 


=  E 


{m  +  2Ii)(ni  +  2Li  +  2)i"'  1\al,,,, 


(n2-3)- 


(4.3.14) 


where 

>lr,i.oi    =  E 


{2Li+ai-2)<f>,,{Vi){l-Vi) 


2         _  1-Vi 
(ni  -  ai  +  4)(ni  +  1)        Ui  +  2  +  2Li     ni  + 


(4.3.15) 


where  Z-i  ~  Pot55on  *°<^  ^il-^i  ~ -^^^o  (^(^  +         Kj^i  -  1)). 

Proof.       Let  Ti  =  niFj^  and  Gi  =  Ti  +  5i  so  that 

'     1  +  H^     l+?\/5i  Gi" 
Then  ^0  =  GiMVi)/S2  and  e^^  =  GiVa,(K)/52  where  V.(l^i)  =  ^^^^ 
V'ax(Vi)  =  ((na  -  5)(1  -  Vi){l  -  <f>a^{Vi))/{ni  +  1).  For  an  estimator  GMVi) / S2  of 
T],  one  has 


=  E 


mi/<ri,l.l 


(GiV(Ki)/52  7,-^-1)' 


=  E. 


'mi/<ri,l,l 


(GiV'(V^i)/52-1)'1  (4.3.16) 


If  we  write  Ti  =  (2<Tj)~*nimJ,  then,  under  the  reparametrization  (mi/<Ti,  1, 7712/(72, 1 ), 
5i  and  Ti  are  independently  distributed  with  Si  ~  Xni-i  and  Ti  ~  Xi('''i)-  Introduc- 
ing the  dummy  variable  Li  ~  Potsson{Ti),  Vi  and  Gi  are  independent  conditional 


on  Li  with  Kill  ~  Beta  (|(1  +  2Li),  \{ni  -  1))  and  ~  x 


2 

ni+2i/i ' 


Next  observe  that 


77 


mi  /<Ti  ,1,1 


(Git/>(Fi)/52-l)' 


EE, 


m\l<T\  ,1,1 


(GiV'(Fi)/52-1)'|Li 


2(ni  +  2L,)  Emu.,xi  {^{Vr)/S2\Lr}  +  1 


=    £;[(ni +  2Li)(ni +  2Ii +2)  X 


'■'mi  /(Ti  ,1,1 


52        ni  +  2Li+2^ 


i^i  + 


ni  +  2Ii  +  2 


(4.3.17) 


Then, 

Emi/aul,l 


V.(V^i) 


1 


>ai(V^l) 


1 


52        ni+2Li+2/       V    ^2         ni  +  2Li+2, 


E, 


"»l/(7-l,l,l 


{i^lW  -  <(^i))'  A'  -2{MVi)  -  MVi))/S2\Lr 


—  -£'mi/<Ti,l,l 


mi/<ri,l,l 


(n2-5)(l-yi) 
(ni  +  1)52 


Km 


=  E 


mi/<7-i,l 


ni  + 


E, 


n2  -  5 
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—  2  Emi/<ri,l 


.(ni  +  1) 


n2  —  3 


712—5 


(ni  +  l)  V("i  +  1) 


(4.3.18) 


Now,  using  arguments  similar  to  those  given  in  (3.2.15)  -  (3.2.18),  we  have  the 


It  is  shown  in  Ghosh  (1992b)  that  Ao,2  =  0,  Ai,,2  >  0  for  all  /  =  1, 2, . . .,  Ai,,ai  >  0 
for  all  ci  e  [2, 3)  and  Ao^a^  <  0  for  0  <  ai  <  2. 

The  next  theorem  is  aimed  at  proving  the  risk  dominance  of  Vaua^.b^ 
for  certain  choices  of  ai,  02  and  62.  The  theorem  is  stated  below.  Recall  that  /„(x) 
denotes  the  pdf  of  xl  while  /„(x;  r)  denotes  the  pdf  of  xt{''')i  ^  non-central  chi-square 
with  u  degrees  of  freedom  and  non-centrality  parameter  r. 

Theorem  4.3.4  Suppose  n2  >  6, 
(a)  (f>2iw2)  is  I  in  11)2,  and 


theorem. 


(b)  lim  (f>2{'W2)  = 

w—>oo 


Then  62(81,82,  W2)  =  8i4>2{W2) / 82  dominates  60  under  the  loss  (4.2.1 ),  i.e., 
Em,,.l.l  {62/v  -  1)'  <  [So/v  -  if 


for  all  7712,  ^1  and  cr^  if 


M^2)  < 


(^2-4)  /o"'(l  +  Z2)-^4"'^^2 


=    (f>2o(w2)  (sayj. 


Proof.       Write     =  in2m^/(T|.  Then, 


too  foo  root               X\  \^ 
=      11  {4>2{W2)  If    a;i/ni-l(a;i)/n2-l(x2)/2(iy2a;2;T2)(iXidl2tit«2  . 

Jo  Jo  Jo   K  X2  ) 

(4.3.19) 


Thus, 


h{w2)  1 

X2 


I  fn^-l{Xl)fn2-l{^2)F2{w2X2\T2)dx-i,dx2 


f«>l  d 

Jo  I  dw2 


mXi  "I  ^ 

<^2(ty2)—  -  1|  fni-l{xi)fni-l{x2)F2{w2X2;T2)dXidX'2 


dwi 


=   l[  X\fni-l{Xl)fni-\ix2)f2{w2X2;T2)dxidx2dw2 

Jo  Jo  Jo    K  X2  ) 


m°°     (                 Xi  1  X\ 

2  <  (?!>2(l02)  1  \  <^2("'2)  — /ni-l(a!l)/n2-l(x2)F2(tW2X2;  T2)dxidx2dw2 
K               X2  )  X2 


/•oo  roo  roof  "1  ^ 

=    /    /    /   i<^2(t^2)  If  a;i/„j_i(ii)/„j_i(x2)/2(it'2a;2;T2)(ixiC^i2ciix;2 

70  Jo  Jo    K  X2  ) 


+ 


f°°  f°°  f°°    (  Xi  )  Xi 

/     "^2(^2)/     /     2U2(U^2)  l\  —fn,-liXi)fn,-l{x2)F2{w2X2]T2)dXydx2dw2, 

Jo  Jo   Jo        K  X2  )  X2 

(4.3.20) 


so  that 
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2  {  (;i'2(w2)  1  \  —fni-l{Xl)fn2-l{^2}F2{w2X2;T2)dXidx2 

K  X2         )  X2 


(4.3.21) 


Since  4>2{w2)  <  0,  the  right  hand  side  of  (4.3.21)  is  >  0  if  and  only  if 

Io°  fo°  ^fn,-l{xi)fn:i-lix2)F2{w2X2;T2)dXidx2 


(f>2{w2)  < 


So°  Sq°  -^fni-l{xi)fni-l{x2)F2{w2X2;T2)dxidx2 


/o°°  ^2    fn2-l{x2)F2{w2X2]  T;)  {/o°°  Xifn^-l{Xi)dxi}  dx2 
^2^frx2-l{X2)F2{w2X2;T2){J^  X  J/„j  _1  (Xl  )<iXl  }  rfxj 


(^1-1)  /o°°^2  fn2-l(^2)F2{w2X2\T2)dx2 

(ni  -  l)(ni  +  ^)  X2'^fn^^i{x2)F2{w2X2;T2)dX2 

1  /o°°  a^2"Vn;-l(x2)-^2(t"2a:2;  T2)dX2 
"1  +  1  Io°  ^2^ fn2-l{X2)F2{w2X2]T2)dX2 

1 


Til  + 

where  the  pdf  of  X2  is  given  by 


jEr,{X2)  (say), 


hr,{x2)  OC  12  Vn2-l(a;2)-f'2(t^2X2;  T2)  . 


For  r"  >  r' 


hri'{X2)  F2{w2X2\t!^) 
—2-   (X  — 


t  in  W2X2,  i.e  in  X2. 


(4.3.22) 


(4.3.23) 


K!,{X2)  F2{W2X2\T2) 

Therefore,  E{X2)  is  t  in  rj  so  that  (4.3.22)  holds  for  all      >  0  if  and  only  if 


4>2{'W2)  < 


1         Jo"  ^2^  fn2-\{x2)F2{w2X2)dx2 


"1  +  1   !^  ^2^ fn^-\{x2)F2{w2X2)dx2 


1         /o°°  ^2  Vn.-l(x2)  {Sr^  f2{y)dy)dx, 

"1  +  1  /o°°2;2-Vn.-i(x2)  {ir'f2{y)dy)dx2 
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1  /o°°  Jo"  /n;-l(j2)  f2{x2Z2)dz2dx2 

"1  +  1   /o°°  IP  ^2^  Ui-liXi)  f2{x2Z2}dz2dx2 

1        /o°°  IP  e"^3^2  '  e--^{x2Z2)i~'^dz2dx2 
Jo  Jo         e   2  ^2        e     2  (X2Z2)i  '■dz2dx2 


1      /o""  4"'  (/o~  e-'^('^'^)xT^dx2)  dz2 


n,  +  1  ^u.         (^^oc  e-^(i+^.)xj^dx2)  dz2 


1    r  4"'(H^)-'^r(=^Mz2 


"2-4  /o"''(l  +  Z2)  ^z]  ^dz2 


=  hoM-  (4.3.24) 

We  now  find  conditions  under  which  f}ai%,b2  dominates     under  the  loss  (4.2.1). 

Theorem  4.3.5  If  n2  >  6,  ai  =  2  and  2  <  02  =  62  <  3,  tien  the  HB  estimator  t}^^  u 
dominates  60  under  the  loss  (4.2.1).  •  .  , 

Proof.       From  (4.2.16),  putting  ai  =  2,  * 

,  ^    5i(n2-a2-2)  /o^4^'"'"V+tX2Vr2)-?("^-<''W2  . 

S2     (ni  +  1)     J^ul^'-''"\l  +  u,W2)-'^^-^-'-')du2  ■      ;  . 

'\        (n2  -  a2  -  2)  /o^  (t^)  ^^'""^  (1  +  U2P^2)^^"--^^ni£fa^^2 

^^^^'^  /o(iTt^)'^""'\l+"3Vr2)i(-^)^^.„2 

51  (712  -  a2  -  2)  /o^'x?(^-''')(l  -x)5("'-^)^x        :  ,  •  .  :  .;  ^ 

52  (ni  +  1)      j;^  x5(i-»=)(l  -  x)5("=-')(ii  •  ^  : 
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5i  (n2  -  5)  P 

Beta  (i(3  - 

a2),  |(n2-3))  <t>2 

52  (ni  +  l)p 

Beta  (i(3  - 

a2),  |(n2-5))  <i;2 

Now,  using  (4.3.24),  one  has 


Then, 


7r'^2o(u^2) 

02 


^2 


02,02 


V2,2,2 


S2  (ni  +  1)  /-'/(^+-')  _ 

Si  {712  -  4)  /o"'  X-2-\l  -  X)?("^-'>JX 
5^("1  +  1)  /o  =  XJ-^I  -  x)5("'-^)(ix 


5i  (n2  -  5)  ^ 

Se<a(i,  |(n2-3))  <  V2 

52(ni  +  l)p 

Beta  {I,  |(n2-5))  < 

=  ^2.2.2 


p 
p 


Befa  (i(3  -  aj),  i(n2  -  3))  <  V2 


Seta  (1(3 -aj),  |(n2  -  5))  < 


Beta  (i,  i(n2  -  3))  <  V2 


Beta  (I  i(n2  -  5))  <  V2 


(4.3.25) 


(4.3.26) 


>  1 


(4.3.27) 


by  appealing  to  Lemma  2.1.1  again.  To  prove  that  ^2,02,02  dominates  60,  it  now  suffices 
to  verify  conditions  (a)  and  (b)  of  Theorem  4.3.4. 

To  verify  condition  (a),  note  that  V2  =  1^2/(1  +  ^^2),  so,  as  W2  — >•  00,  U2  — *  1- 
Then,  for  ci  =  2  and  2  <  02  =  ^2  <  3,  one  has  from  (4.3.25), 

n2  —  5 


lim  4>2(w2)  = 

w-2  — »oo 


ni  +  1 


(4.3.28) 
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To  show  that  ^2(1^2)  is  i  in  W2,  define  a  new  pdf, 

/„,(z)oc2'(l-2)'("=-%,„,)(z).  -  (4.3.29) 

<        r  "i 

Then,  for  0  <  t;2  <  v'2,  /„/ <x  /(o,«i)(^)//(o,«j)(2)  T  in  z.  Since,  4>2{w2)  « 
EiZy^^,  where  expectation  is  being  taken  with  respect  to  the  pdf  given  in  (4.3.29), 
it  is  J,  in  V2  and,  hence  in  W2  (see  Lemma  2(i),  p. 85,  Lehmann,  1986). 

We  now  provide  some  numerical  calculations  of  the  risk  improvement  of  Tja^fi^^ai 
over  ^0  for  certain  values  of  rii  and  712-  For  =  6,  Figures  4.1  and  4.2  give  the 
percent  risk  improvement  plotted  against  ti  =  {2a^)~^niml  for  ni  =  3  and  ni  =  7 
respectively.  The  figures  show  that  the  risk  improvement  for  values  of  Ti  close  to  zero 
can  be  quite  substantial  even  for  such  small  values  of  ni  and  712. 

The  HPD  credible  intervals  for  rj  could  not  be  provided  as  they  were  done  in 
Chapters  2  and  3,  since  our  method  of  proof,  which  used  the  log-concavity  property 
of  the  chi-square  distribution  function,  could  not  be  carried  through  in  this  case  as  the 
distribution  function  of  an  F  distribution  does  not  have  this  property.  Nagata  (1989) 
provides  expressions  for  confidence  intervals  of  rj  based  on  Stein-type  testimators. 


Fig  4.1. 


Percent  Risk  Improvement  of  77a,a,i  over     for  ni  =  3  and      =  6 


85, 


1.5 


1.0 


0.5 


c 


0.0 


-0.5 


■1.0 


 a -2.5 


\'  ill 


-1.5  '  '  1  1  ^  \  1  L_ 

0       10      20      30      40      50  60 


Fig  4.2.      Percent  Risk  Improvement  of  t^^.^.i  over  60  for      =  7  and      =  6 


CHAPTERS  ; 
SUMMARY  AND  FUTURE  RESEARCH 


In  the  analysis  of  linear  models,  point  and  interval  estimation  of  the  error  variance 
play  an  important  role.  For  the  fixed-effects  balanced  one-way  ANOVA  model,  we 
obtained  in  Chapter  2  two  classes  of  hierarchical  Bayes  estimators  of  the  variance, 
subclasses  of  which  were  shown  to  dominate  the  best  multiple  estimator  of  the  error 
sum  of  squares  under  the  entropy  loss.  The  numerical  calculations  that  were  provided 
in  Chapter  2  showed  that  the  risk  improvement  could  be  quite  substantial  at  times. 
Also,  our  numerical  calculations  indicate  that  every  member  of  these  two  subclasses 
of  hierarchical  Bayes  estimators  is  admissible.  Such  an  admissibility  study  will  be  an 
interesting  theoretical  topic  for  future  research. 

In  Chapter  3,  hierarchical  Bayes  models  similar  to  those  proposed  in  Chapter  2 
were  considered  for  two  nested  regression  models  to  obtain  estimators  which  again 
dominated  the  best  multiple  estimator  under  any  quadratic  loss.  In  both  Chapters 
2  and  3,  highest  posterior  density  (HPD)  credible  intervals  were  provided  for  the 
error  variance.  A  class  of  these  HPD  intervals  not  only  had  shorter  length  but 
also  had  higher  probability  of  coverage  than  the  usual  minimum  length  and  shortest 
unbiased  confidence  intervals.  In  Chapter  4,  two-sample  extensions  of  the  hierarchical 
Bayes  models  proposed  in  Chapter  2  were  used  to  obtain  two  classes  of  hierarchical 
Bayes  estimators  of  the  variance  ratio.  A  subclass  of  these  estimators  was  shown  to 
dominate  the  best  multiple  estimator  of  the  ratio  of  the  sample  variances.     .  ..  • 

An  important  topic  of  future  research  will  be  to  study  the  performance  of  the 
proposed  HB  estimators  of  the  error  variance  in  one-way  random-effects  models.  In 
addition,  for  such  models,  estimation  of  the  treatment  variance  as  well  as  the  ratio  of 
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the  treatment  variance  to  the  error  variance  are  of  interest.  Klotz,  Milton  and  Zacks 
(1969)  developed  Stein-type  estimators  which  dominated  the  corresponding  ANOVA 
estimators  of  the  variance  components,  but  such  estimators  being  non-smooth  cannot 
be  Bayes  with  respect  to  any  prior  under  quadratic  loss.  Portnoy  (1971)  proposed 
certain  hierarchical  Bayes  estimators  but  could  not  prove  any  analytical  dominance  of 
such  estimators  over  the  usual  estimators.  It  may  be  possible  to  extend  our  theoretical 
findings  to  the  estimation  of  the  variance  components  or  the  variance  ratio  in  one-way 
random-effects  ANOVA  models. 

A  multivariate  extension  of  our  results  to  estimation  of  the  generalized  error  vari- 
ance in  multivariate  one-way  analysis  of  variance  models  is  also  of  interest.  Shorrock 
and  Zidek  (1976),  Sinha  (1976),  and  Sinha  and  Ghosh  (1987)  proved  inadmissibility 
of  the  best  equivariant  estimator  of  the  generalized  variance  under  a  variety  of  losses. 
Development  of  estimators  of  the  generalized  variance  in  a  multivariate  setting  us- 
ing a  hierarchical  Bayes  model  remains  an  open  question,  and  is  worthy  of  future 
exploration.  ,    ,  . 
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